0 




GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein 



nucleic search, using f rame_plus_p2n model 



Run on: 



April 16, 2005, 11:30:21 



Search time 1723.65 Seconds 

(without alignments) 

3963.787 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 



US-09-830-972-32 
705 

1 QAS G EAGVS C L REN FAVY S V 



ESEVAISEELVQKYSNSALG 141 



Scoring table: 



BLOSUM62 



Xgapop 10.0 , Xgapext 

Ygapop 10.0 , Ygapext 

Fgapop 6.0 , Fgapext 

Delop 6.0 , Delext 



0.5 
0.5 
7.0 
7.0 



Searched: 



4708233 seqs, 24227607955 residues 



Total number of hits satisfying chosen parameters: 



9416466 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Command line parameters : 
-MODEL=f rame+j>2n. model -DEV=xlh 

Q=/cgn2_l/USPTO_spool_h/US09830972/runat_14 042005_094842_1834 8/app_query. fasta_l 
.3683 

-DB=GenEmbl -QFMT=fastap -SUFFIX=rge -MINMATCH=0 . 1 -LOOPCL=0 -LOOPEXT=0 
-UNITS=bits -START=1 -END=-1 -MATRIX=blosum62 -TRANS=human40 . cdi -LIST=45 
-DOCALIGN=200 -THR_SCORE=pct -THR_MAX=100 -THR_MIN=0 -ALIGN- 15 -MODE=LOCAL 
-OUTFMT=pto -NORM=ext -HEAPSIZE=500 -MINLEN=0 -MAXLEN=2 000000000 
-USER=US09830972_@CGN_l_l_19954_@runat_14042005_094842_18348 -NCPU=6 -ICPU=3 
-NO_MMAP -LARGEQUERY -NEG_SCORES=0 -^WAIT -DSPBLOCK=100 ■ -LONGLOG 

-DEV_TIMEOUT=120 -WARN_TIMEOUT=30 -THREADS=1 -XGAPOP=10 -XGAPEXT=0 . 5 -FGAPOP=6 
-FGAPEXT =7 -YGAPOP=10 -YGAPEXT=0 . 5 -DELOP=6 -DELEXT=7 

Database : GenEmbl : * 



i 
f 



1: 
2: 
3: 
4: 
5: 
6: 
7: 
8: 
9: 
10 



gb_ba : * 
gb_htg : * 
gb_in : * 
gb_om: * 
gb_ov: * 
gb_pat : * 
gb_ph : * 
gb_pl : * 
gb_pr : * 
gb_ro : * 




11: gb_sts:* 
12: gb_sy:* 
1 3 : gb_un : * 
14: gb_vi:* 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 



Jo. 


Score 


Match 


Length 


DB 


ID 


Description 


1 


680.5 


96. 


5 


1568 


10 


RN0242963 


AJ242963 Rattus no 


2 


635.5 


90. 


1 


1769 


10 


AY102283 


AY102283 Mus muscu 


3 


589.5 


83. 


6 


764 


10 


AF051335 


AF051335 Rattus no 


4 


581.5 


82 . 


5 


734 


10 


AF326337 


AF326337 Mus muscu 


5 


572.5 


81. 


2 


1079 


9 


BC007109 


BC007109 Homo sapi 


6 


572.5 


81 . 


2 


1151 


9 


BC001035 


BC001035 Homo sapi 


7 


572.5 


81 . 


2 


1691 


9 


AF132048 


AF13204 8 Homo sapi 


8 


572.5 


81. 


2 


1698 


9 


BC014366 


BC014366 Homo sapi 


9 


572 . 5 


81. 


2 


1700 


9 


AF177332 


AF177332 Homo sapi 


10 


572 . 5 


81. 


2 


1800 


9 


AY102276 


AY102276 Homo sapi 


11 


571.5 


81 . 


1 


1213 


6 


BD194907 


BD194907 86 human 


12 


571.5 


81. 


1 


1213 


6 


CQ855235 


CQ855235 Sequence 


13 


565 


80. 


1 


1721 


4 


AY164744 


AY164744 Bos tauru 


14 


563.5 


79 . 


9 


2162 


9 


AB049853 


AB049853 Macaca fa 


15 


559.5 


79. 


4 


2610 


6 


AR379837 


AR379837 Sequence 


16 


558 


79. 


1 


1617 


9 


AF087901 


AF087901 Homo sapi 


17 


531 


75. 


3 


799 


6 


AR028522 


AR028522 Sequence 


18 


525 


74 . 


5 


994 


6 


BD139293 


BD139293 Extended 


19 


509.5 


72 . 


3 


404 


6 


BD076183 


BD076183 5' EST of 


20 


503 


71 . 


3 


600 


9 


HSA251385 


AJ251385 Homo sapi 


21 


485 


68 . 


8 


1363 


5 


BX933922 


BX933922 Gallus ga 


22 


485 


68 . 


8 


1578 


5 


CR353502 


CR353502 Gallus ga 


23 


482 


68 . 


4 


1597 


5 


AY164737 


AY164737 Gallus ga 


24 


470.5 


DO . 


/ 


639 


6 


AX410838 


AX410838 Sequence 


25 


455 


64. 


5 


4822 


6 


AR220865 


AR220865 Sequence 


26 


447 


63. 


4 


1738 


10 


AB073672 


AB073672 Mus muscu 


27 


447 


63. 


4 


2481 


9 


AF063601 


AF063601 Homo sapi 


28 


447 


63. 


4 


2883 


9 


AF320999 


AF320999 Homo sapi 


29 


447 


63. 


4 


2958 


10 


BC032192 


BC032192 Mus muscu 


30 


447 


63. 


4 


2974 


6 


CQ716296 


CQ716296 Sequence 


31 


447 


63. 


4 


3489 


6 


AX766046 


AX766046 Sequence 


32 


447 


63. 


4 


3491 


9 


AF333336 


AF333336 Homo sapi 


33 


447 


63. 


4 


3492 


6 


CQ829507 


CQ829507 Sequence 


34 


447 


63. 


4 


3576 


6 


AX766050 


AX766050 Sequence 


35 


447 


63. 


4 


3579 


6 


BD249446 


BD249446 Protein s 


36 


447 


63. 


4 


3579 


9 


HSA251383 


AJ251383 Homo sapi 


37 


447 


63. 


4 


3815 


10 


BC032272 


BC032272 Mus muscu 


38 


447 


63. 


4 


3919 


6 


CQ829486 


CQ829486 Sequence 


39 


447 


63. 


4 


4053 


6 


AX195249 


AX195249 Sequence 


40 


447 


63. 


4 


4053 


9 


AB020693 


AB020693 Homo sapi 


41 


447 


63. 


4 


4060 


9 


AY123250 


AY123250 Homo sapi 


42 


447 


63. 


4 


4063 


10 


AY102280 


AY102280 Mus muscu 


45 


447 


63. 


4 


4070 


9 


AY123249 


AY12324 9 Homo sapi 



44 447 63.4 4093 6 BD270070 

45 447 63.4 4102 9 AY123245 



BD270070 Secreted 
AY123245 Homo sapi 



ALIGNMENTS 



RESULT 1 

RN0242963 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



COMMENT 
FEATURES 

source 



gene 
CDS 



RN0242963 1568 bp mRNA linear ROD 28-JAN-2000 

Rattus norvegicus mRNA for Nogo-C protein. 

AJ242963 

AJ242963.1 GI:6822250 
Nogo-C protein. 

Rattus norvegicus (Norway rat) 
Rattus norvegicus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; 
Rattus. 
1 

Chen,M.S., Huber,A.B., van der Haar,M.E., Frank, M., Schnell,L., 
Spillmann, A. A. , Christ, F. and Schwab, M.E. 

Nogo-A is a myelin-associated neurite outgrowth inhibitor and an 

antigen for monoclonal antibody IN-1 

Nature 403 (6768), 434-439 (2000) 

20129258 

10667796 

2 (bases 1 to 1568) 
Van der Haar,M.E. 
Direct Submission 

Submitted ( 14- JUN-1999) Van der Haar M.E., Department . of 
Neuromorphology, Brain Research Institute, University of Zurich, 
Winterthurerstrasse 190, Zurich, CH-8057, SWITZERLAND 
Related sequences: AJ242961-2. 

Location/ Qualifiers 

1. .1568 

/organism="Rattus norvegicus" 

/mol_type= n mRNA n 

/db_xref="taxon: 10116" 

1. .1568 

/gene="nogo-C" 

118. .717 

/gene="nogo-C" 

/ f unction="unknowh" 

/note="The nogo gene encodes different transcripts. Nogo-A 

en -C have a unique 5' end but share the same 3 f end" 

/ codon_start=l 

/product— "Nogo-C protein" 

/protein_id="CAB71029. 1" 

/db_xref="GI : 6822251" 

/db_xref="GOA:Q9JKll" 

/db_xref ="UniProt/Swiss-Prot : Q9 JK11 " 

/translation="MDGQKKHWKDKWDLLYWRDIKKTGWFGASLFLLLSLTVFSIV 
SWAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSA 
LGHWSTIKELRRLFLVDDLVDSLKFAVljMWVFTYVGALFNGLTLLILALISLFS I PV 
I YERHQVQI DH YLGLANKS VKDAMAKI QAKI PGLKRKAD " 



ORIGIN 



GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - nucleic search, using f rame_plus_p2n model 

Run on: April 16, 2005, 11:29:46 ; Search time 199.52 Seconds 

(without alignments) 
4183.459 Million cell updates/sec 

Title: US-09-830-972-32 
Perfect score: 705 

Sequence: 1 Q AS GEAGVS C L REN FAVYS V ESEVAISEELVQKYSNSALG 141 



Scoring table: 



BLOSUM62 

Xgapop 10.0 , Xgapext 0.5 

Ygapop 10.0 , Ygapext 0.5 

Fgapop 6.0 , Fgapext 7.0 

Delop 6.0 , Delext 7.0 



Searched: 



4390206 seqs, 2959870667 residues 



Total number of hits satisfying chosen parameters: 



8780412 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Command line parameters: 
-MODEL=f rame+_p2n. model -DEV=xlh 

Q=/cgn2_l/USPTO_spool_h/US09830972/runat_14042005_094 842_18342/app_query. fasta_l 
.3683 

-DB=N_Geneseq -QFMT=fastap -SUFFIX=rng -MINMATCH=0 . 1 -LOOPCL=0 -LOOPEXT=0 
-UNITS=bits -START=1 -END=-1 -MATRIX=blosum62 -TRANS=human4 0 . cdi -LIST=45 
-DOCALIGN=200 -THR_SCORE=pct -THR_MAX=100 -THR_MIN=0 -ALIGN- 15 -MODE=LOCAL 
-OUTFMT=pto -NORM=ext -HEAPSIZE=500 -MINLEN=0 -MAXLEN=2 000000000 
-USER=US09830972_@CGN_l_l_1955_@runat_14042005_094842_18342 -NCPU=6 -ICPU=3 
-NO_MMAP -LARGEQUERY -NEG_SCORES=0 -WAIT -DSPBLOCK=100 -LONGLOG 

-DEV_TIMEOUT=120 -WARN_TIMEOUT=30 -THREADS=1 -XGAPOP=10 -XGAPEXT=0 . 5 -FGAPOP =6 
-FGAPEXT=7 -YGAPOP=10 -YGAPEXT=0 . 5 -DELOP=6 -DELEXT=7 



Database : N_Geneseq_16Dec04 : * 

1: geneseqnl980s : * 

2 : geneseqnl990s : * 

3 : geneseqn2000s : * 

4 : geneseqn2001as : * 

5: geneseqn2001bs : * 

6: geneseqn2002as : * 

7: geneseqn2002bs : * 

8 : geneseqn2003as : * 

9: geneseqn2003bs : * 
10 : geneseqn2003cs : * 



11 

12 
13 



geneseqn2003ds : * 
geneseqn2004as : * 
geneseqn2004bs : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 



fo. 


Score 


Match 


Length 


DB 


ID 


Description 


1 


680.5 


96. 


5 


1568 


3 


AAD01175 


Aad01175 Rat neuri 


2 


581.5 


82 . 


5 


734 


12 


ADF42781 


Adf42781 Mouse CYP 


3 


572 . 5 


81. 


2 


1798 


6 


ABK90135 


Abk90135 DNA encod 


4 


571.5 


81. 


1 


668 


6 


ABL89601 


Abl89601 Human pol 


5 


571.5 


81. 


1 


1213 


2 


AAX04379 


Aax04379 Human sec 


6 


566 


80. 


3 


1785 


12 


ADK14166 


Adkl4166 Human aut 


7 


559 . 5 


79. 


4 


2610 


11 


ADI31056 


Adi31056 Human cDN 


8 


553.5 


78 . 


5 


770 


3 


AAA72983 


Aaa72983 Human NSP 


9 


531 


75. 


3 


431 


8 


ABX37040 


Abx37040 Bovine ES 


10 


531 


75. 


3 


799 


2 


AAV23695 


Aav23695 Human NSP 


11 


525 


74 . 


5 


991 


2 


AAX97587 


Aax97587 Extended 


12 


525 


74 . 


5 


994 


12 


ADP18854 


Adpl8854 Human sec 


13 


509. 5 


72 . 


3 


404 


2 


AAX41193 


Aax41193 Human sec 


14 


503 


71. 


3 


600 


4 


AAF90323 


Aaf90323 Human NOG 


15 


500 


70. 


9 


389 


8 


ABX39989 


Abx39989 Bovine ES 


16 


470.5 


66. 


7 


639 


6 


ABN96987 


Abn96987 Gene #348 


17 


455 


64 . 


5 


4822 


6 


ABS70449 


Abs70449 Human bon 


18 


447 


63 . 


4 


3492 


12 


ADP45571 


Adp45571 Rat NogoA 


19 


447 


63. 


4 


3579 


3 


AAZ56886 


Aaz56886 Human MAG 


20 


447 


63. 


4 


3579 


4 


AAF90324 


Aaf90324 Human NOG 


21 


447 


63. 


4 


3579 


6 


ABK90134 


Abk90134 DNA encod 


22 


447 


63 . 


4 


3579 


6 


ABN86601 


Abn86601 Human neu 


23 


447 ' 


63 . 


4 


3579 


12 


ADO07886 


Ado07886 Human pol 


24 


447 


63 . 


4 


3579 


12 


ADR13965 


Adrl3965 Human NOG 


25 


447 


63 . 


4 


3833 


3 


AAD01174 


Aad01174 Bovine ne 


26 


447 


63 . 


4 


3919 


12 


ADP45550 


Adp45550 Human Nog 


27 


447 


63. 


4 


4053 


4 


AAS09453 


Aas 09453 Human cDN 


28 


447 


63. 


4 


4053 


9 


ACC81048 


Acc8104 8 Human Nog 


29 


447 


63. 


4 


4053 


12 


ADP13574 


Adpl3574 Renal eel 


30 


447 


63. 


4 


4093 


3 


AAA23454 


Aaa23454 cDNA enco 


31 


447 


63. 


4 


4632 


6 


ABV94680 


Abv94680 Human pan 


32 


447 


63. 


4 


4632 


10 


ADG32772 


Adg32772 Human DNA 


33 


447 


63. 


4 


4684 


3 


AAD01173 


Aad01173 Rat neuri 


34 


447 


63. 


4 


4684 


6 


ABN86600 


Abn86600 Rat neuro 


35 


447 


63. 


4 


4698 


8 


ABX34563 


Abx34563 Human mdd 


36 


447 


63. 


4 


4789 


13 


ADR83534 


Adr83534 Human ret 


37 


443 


62. 


8 


1122 


3 


AAZ56888 


Aaz56888 Human MAG 


38 


443 


62. 


8 


1122 


4 


AAF90325 


Aaf90325 Human NOG 


39 


443 


62. 


8 


1216 


6 


ABA05903 


Aba 05 903 Human RTN 


40 


443 


62. 


8 


1599 


10 


ADI62860 


Adi62860 Human apo 


41 


443 


62. 


8 


1610 


3 


AAZ36230 


Aaz36230 cDNA enco 


42 


443 


62. 


8 


1683 


4 


AAD08386 


Aad08386 Human sec 


43 


443 


62. 


8 


1758 


4 


AAF32725 


Aaf32725 Human sec 


44 


443 


62. 


8 


2052 


6 


ABK90133 


Abk90133 DNA encod 



45 443 62.8 2235 6 ABV94681 Abv94681 Human pan 

ALIGNMENTS 

RESULT 1 
AAD01175 

ID AAD01175 standard; cDNA; 1568 BP. 
XX 

AC AAD01175; 
XX 

DT 02-NOV-2000 (first entry) 
XX 

DE Rat neurite growth inhibitor Nogo C cDNA. 
XX 

KW Rat; neurite growth inhibitor; Nogo C; neural cell; myelin; CNS; 

KW central nervous system; neoplastic disease; antiproliferative; glioma; 

KW antisense gene therapy; neuroblastoma; menagioma; retinoblastoma; 

KW degenerative nerve disease; Alzheimer's disease; Parkinson's disease; 

KW hyperprolif erative disorder; benign dysprolif erative disorder; diagnosis; 

KW psoriasis; tissue hypertrophy; neuronal regeneration; treatment; 

KW structural plasticity; screening; ss. 

XX 

OS Rattus sp. 
XX 

FH Key Location/Qualifiers 
FT CDS 1. .1566 

FT /*tag= a 

FT /product= "Nogo C protein (residues 40-238) flanked by 1- 

FT 39 residues at the N-terminal and 239-522 residues at the 

FT C- terminal" 



FT 


/ transl 


except= 


(pos : 


7. .: 


3, aa:Xaa) 


FT 


/ transl 


except= 


(pos : 


85. 


.87, aa:Xaa) 


FT 


/ transl 


except= 


(pos : 


787. 


.789, aa 


:Xaa) 


FT 


/transl 


except= 


(pos : 


826. 


.828, aa 


:Xaa) 


FT 


/transl 


except= 


(pos : 


841. 


.843, aa 


:Xaa) 


FT 


/transl 


except= 


(pos : 


883. 


.885, aa 


:Xaa) 


FT 


/ transl 


except= 


(pos : 


889. 


.891, aa 


:Xaa). 


FT 


/ transl 


except= 


(pos : 


940. 


.942, aa 


:Xaa) 


FT 


/transl 


except= 


(pos : 


952. 


.954, aa 


:Xaa) 


FT 


/ transl 


except= 


(pos : 


1003 


. .1005, 


aa : Xaa) 


FT 


/transl_ 


except= 


(pos: 


1111 


. .1113, 


aa :Xaa) 


FT 


/transl_ 


except= 


(pos: 


1120 


. .1122, 


aa :Xaa) 


FT 


/transl 


except= 


(pos: 


1138 


. .1140, 


aa:Xaa) 


FT 


/transl 


except= 


(pos: 


1216 


. .1218, 


aa : Xaa) 


FT 


/transl_ 


except= 


(pos : 


1222 


. .1224, 


aa : Xaa) 


FT 


/transl 


except= 


(pos : 


1228 


. .1230, 


aa : Xaa) 


FT 


/transl_ 


except= 


(pos : 


1264 


. .1266, 


aa:Xaa) 


FT 


/transl_ 


except= 


(pos: 


1297 


. .1299, 


aa :Xaa) 


FT 


/transl_ 


except= 


(pos: 


1318 


. .1320, 


aa:Xaa) 


FT 


/transl_ 


except= 


(pos : 


1357 


. .1359, 


aa:Xaa) 


FT 


/transl_ 


_except= 


(pos: 


1393 


. .1395, 


aa:Xaa) 


FT 


/transl_ 


_except= 


(pos : 


1444 


. .1446, 


aa:Xaa) 


FT 


/ transl_ 


_except= 


(pos: 


1537 


. .1539, 


aarXaa) 



FT /note= "Xaa corresponds to in-frame stop codon; the CDS 

FT does not end in a stop codon" 

FT /partial 



GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - nucleic search, using f rame_plus_p2n model 

Run on: April 16 f 2005, 11:35:57 ; Search time 61.5766 Seconds 

(without alignments) 
3746.799 Million cell updates/sec 

Title: US-09-830-972-32 
Perfect score: 705 

Sequence: 1 QAS G EAG VS C L REN FAVY S V ES EVAI S EELVQKYSNS ALG 141 

Scoring table: BLOSUM62 

Xgapop 10.0 , Xgapext 0.5 

Ygapop 10.0 , Ygapext 0.5 

Fgapop 6.0 , Fgapext 7.0 

Delop 6.0 , Delext 7.0 

Searched: 1202784 seqs, 818138359 residues 

Total number of hits satisfying chosen parameters: 2405568 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Command line parameters: 
-MODEL=f rame+_p2n. model -DEV=xlh 

Q=/cgn2_l/USPTO_spool_h/US09830972/runat_14 042005_094 844_18372/app_query. fasta_l 
.3683 

-DB=Issued__Patents_NA -QFMT=fastap -SUFFIX=rni -MINMATCH=0 . 1 -LOOPCL=0 
-LOOPEXT=0 -UNITS=bits -START=1 -END=-1 -MATRIX=blosum62 -TRANS=human4 0 . cdi 
-LIST=45 -DOCALIGN=200 -THR_SCORE=pct -THR_MAX=100 -THR_MIN=0 -ALIGN=15 
-MODE=LOCAL -OUTFMT=pto -NORM=ext -HEAPSIZE=500 -MINLEN=0 -MAXLEN=2 000000000 
-USER=US09830972_@CGN_l_l_255_@runat_14 042 005_094844_1 8372 -NCPU=6 -ICPU=3 
-NO_MMAP -LARGEQUERY -NEG_SCORES=0 -WAIT -DSPBLOCK=100 -LONGLOG 

-DEV__TIMEOUT=120 -WARN_TIMEOUT=30 -THREADS=1 -XGAPOP=10 -XGAPEXT=0.5 -FGAPOP=6 
-FGAPEXT=7 -YGAPOP=10 -YGAPEXT=0.5 -DEL0P=6 -DELEXT=7 

Database : Issued Patents NA: * 



/cgn2_6/ptodata/l/ina/5A_COMB. seq: * 
/cgn2_6/ptodata/l/ina/5B_COMB. seq: * 
/cgn2_6/ptodata/l/ina/6A_COMB. seq: * 
/cgn2_6/ptodata/l/ina/6B_COMB. seq: * 
/cgn2_6/ptodata/l/ina/PCTUS_COMB. seq: 
/cgn2 6/ptodata/l/ina/backfilesl . seq: 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 





No. 


Score 


Match 


Length 


DB 


ID 








Description 




1 


572.5 


81 . 


2 


1669 


4 


us- 


09- 


949- 


016-3253 


Sequence 


3253, Ap 




2 


559.5 


79 . 


4 


2610 


4 


us- 


09- 


023- 


655-382 


Sequence 


382, App 




3 


531 


75 . 


3 


799 


2 


us- 


08- 


700- 


607-2 


Sequence 


2, Appli 




4 


455 


64 . 


5 


4822 


3 


us- 


09- 


484- 


970B-106 


Sequence 


106, App 




5 


337 


47 . 


8 


2069 


4 


us- 


09- 


949- 


016-3309 


Sequence 


3309, Ap 




6 


337 


47 . 


8 


3202 


4 


us- 


09- 


949- 


016-1127 


Sequence 


1127, Ap 


c 


7 


335 


47 . 


5 


601 


4 


us- 


09- 


949- 


016-117588 ' 


Sequence 


117588, 


c 


8 


335 


47 . 


5 


601 


4 


us- 


09- 


949- 


016-117589 


Sequence 


117589, 




9 


335 


47 . 


5 


42075 


4 


us- 


09- 


949- 


016-14995 


Sequence 


14995, A 




10 


311 


44 . 


, 1 


3517 


4 


us- 


09- 


799- 


451-111 


Sequence 


111, App 




11 


309 


43. 


8 


1766 


3 


us- 


09- 


149- 


476-254 


Sequence 


254, App 




12 


309 


. 43. 


8 


2262 


4 


us- 


09- 


949- 


016-2988 


Sequence 


2988, Ap 




13 


309 


43. 


8 


2664 


3 


us- 


09- 


149- 


476-255 


Sequence 


255, App 




14 


305 


43. 


3 


1095 


2 


us- 


08- 


700- 


607-4 


Sequence 


4, Appli 




15 


297 


42. 


. 1 


443 


4 


us- 


09- 


513- 


999C-3784 


Sequence 


3784, Ap 




16 


270 


38 . 


.3 


261 


2 


us- 


08- 


700- 


607-9 


Sequence 


9, Appli 


c 


17 


268 


38 . 


, 0 


601 


4 


us- 


09- 


949- 


016-40169 


Sequence 


40169, A 


c 


18 


268 


38 . 


. 0 


601 


4 


us- 


09- 


949- 


016-119335 


Sequence 


119335, 




19 


268 


38. 


0 


135667 


4 


us- 


09- 


949- 


016-15051 


Sequence 


15051, A 




20 


268 


38. 


.0 


152486 


4 


us- 


09- 


949- 


016-12869 


Sequence 


12869, A 




21 


261 


37 . 


.0 


13906 


4 


us- 


09- 


949- 


016-14730 


Sequence 


14730, A 




22 


250 


35, 


, 5 


2014 


4 


us- 


09- 


270- 


767-13561 


Sequence 


13561, A 




23 


229. 5 


32 . 


, 6 


794 


3 


us- 


09- 


149- 


■476-102 


Sequence 


102, App 




24 


214 


30. 


,4 


2181 


4 


us- 


09- 


949- 


■016-1419 


Sequence 


1419, Ap 




25 


177 


25. 


, 1 


15661 


4 


us- 


09- 


949- 


■016-13161 


Sequence 


13161, A 


c 


26 


129 


18 , 


.3 


601 


4 


us- 


09- 


949- 


■016-117566 


Sequence 


117566, 


c 


27 


101 


14 . 


, 3 


601 


4 


us- 


09- 


949- 


016-48087 


Sequence 


48087, A 




28 


99 


14 . 


, 0 


425 


3 


us- 


08- 


905- 


223-178 


Sequence 


17 8, App 


c 


29 


86 


12 , 


.2 


3042 


4 


us- 


09- 


762- 


■724-1 


Sequence 


1, Appli 




30 


79.5 


11, 


.3 


80355 


4 


us- 


09- 


■949- 


016-12735 


Sequence 


12735, A 




31 


79.5 


11 . 


. 3 


80357 


A 


us- 


09- 


■949- 


016-13572 


Sequence 


13572, A 




32 


78 


11 , 


, 1 


2535 


3 


us- 


09- 


422- 


■936-46 


Sequence 


46, Appl 




33 


78 


11 , 


. 1 


2883 


3 


us- 


09- 


422- 


■936-44 


Sequence 


44, Appl 




34 


76.5 


10 , 


. 9 


1797 


2 


us- 


08- 


■853- 


659A-28 


Sequence 


28, Appl 




35 


76.5 


10. 


.9 


8967 


2 


us- 


08- 


853- 


•659A-6 


Sequence 


6, Appli 


c 


36 


76.5 


10. 


.9 


8967 


2 


us- 


08- 


853- 


•659A-9 


Sequence 


9, Appli 




37 


76.5 


10. 


.9 


8967 


2 


us- 


08- 


853- 


■659A-64 


Sequence 


64, Appl 


c 


38 


76.5 


10. 


.9 


8967 


2 


us- 


08- 


■853- 


■659A-67 


Sequence 


67, Appl 




39 


76.5 


10. 


.9 


24701 


2 


us- 


08- 


■853^ 


•659A-2 


Sequence 


2, Appli 


c 


40 


76.5 


10. 


.9 


24701 


2 


us- 


08- 


■853- 


•659A-3 


Sequence 


3, Appli 




41 


76.5 


10. 


.9 


24701 


2 


us- 


■08- 


■853- 


■659A-60 


Sequence 


60, Appl 


c 


42 


76.5 


10, 


.9 


24701 


2 


us- 


08- 


■853- 


■659A-61 


Sequence 


61, Appl 


c 


43 


76.5 


10, 


.9 


40429 


4 


us- 


•08- 


■311- 


■731A-125 


Sequence 


125, App 


c 


44 


75.5 


10, 


.1 


897 


4 


us- 


09- 


-248- 


■796A-3257 


Sequence 


3257, Ap 


c 


45 


75 


10, 


.6 


1426 


3 


us- 


09- 


-121- 


-425-4 


Sequence 


4, Appli 



ALIGNMENTS 

RESULT 1 

US-09-949-016-3253 



GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - nucleic search, using f rame_plus_p2n model 

Run on: April 16, 2005, 22:42:49 ; Search time 216.627 Seconds 

(without alignments) 
3948.747 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



US-09-830-972-32 
705 

1 QAS GEAGVS C L REN FAVY S V ESEVAI SEELVQKYSNSALG 141 

BLOSUM62 

Xgapop 10.0 , Xgapext 0.5 
Ygapop 10.0 , Ygapext 0.5 
Fgapop 6.0 , Fgapext 7.0 
Delop 6.0 , Delext 7.0 



Searched: \ 



5622541 seqs, 3033355566 residues 



11245082 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Command line parameters: 
-MODEL=f rame+_p2n. model -DEV=xlh 

Q=/cgn2_l/USPTO_spool_h/US09830972/runat_14042005_094847_18 4 66/app_query. fasta_l 
.3683 

-DB=Published_Applications_NA -QFMT=fastap -SUFFIX=rnpb -MINMATCH=0 . 1 
-LOOPCL=0 -LOOPEXT=0 -UNITS=bits -START=1 -END=-1 -MATRIX=blosum62 
-TRANS=human4 0.cdi -LIST=45 -DOCALIGN=200 -THR_SCORE=pct -THR_MAX=100 
-THR_MIN=0 -ALIGN=15 -MODE=LOCAL -OUTFMT=pto -NORM=ext -HEAPSI ZE=500 -MINLEN=0 
-MAXLEN=2000000000 -USER=US09830972_@CGN_l_l_2092_@runat_14042005_094847_184 66 
-NCPU=6 -ICPU=3 -NO_MMAP -LARGEQUERY -NEG_SCORES=0 -WAIT -DSPBLOCK=100 
-LONGLOG -DEV_TIMEOUT-120 -WARN_TIMEOUT=30 -THREADS=1 -XGAPOP=10 -XGAPEXT=0.5 
-FGAPOP- 6 -FGAPEXT =7 -YGAPOP=10 -YGAPEXT=0 . 5 -DELOP=6 -DELEXT=7 



Database : 



Published 
1 
2 
3 
4 
5 
6 
7 
8 
9 

10 



/cgn2 
/cgn2 
/cgn2 
/cgn2_ 
/ cgn2 
/cgn2_i 
/cgn2 
/cgn2_i 
/cgn2_ 
/cgn2 



Applications_NA: * 

6/ptodata/2/pubpna/US07_PUBCOMB. seq:* 
6/ptodata/2/pubpna/PCT_NEW__PUB.seq:* 
'6/ptodata/2/pubpna/US06_NEW_PUB. seq: * 
6/ptodata/2/pubpna/US06_PUBCOMB . seq: * 
6/ptodata/2/pubpna/US07_NEW_PUB. seq: * 
6/ptodata/2/pubpna/PCTUS_PUBCOMB . seq : * 
6/ptodata/2/pubpna/US08_NEW_PUB.seq: * 
6/ptodata/2/pubpna/US08_PUBCOMB. seq: * 
6/ptodata/2/pubpna/US09A_PUBCOMB. seq:* 
6/ptodata/2/pubpna/US09B_PUBCOMB.seq:* 



11: /cgn2_6/ptodata/2/pubpna/US09C_PUBCOMB.seq: * 

12 : /cgn2_6/ptodata/2/pubpna/US09_NEW_PUB . seq : * 

13 : /cgn2_6/ptodata/2/pubpna/US10A_PUBCOMB. seq: * 

14 : /cgn2_6/ptodata/2/pubpna/US10B_PUBCOMB. seq: * 

15 : /cgn2_6/ptodata/2/pubpna/US10C_PUBCOMB. seq: * 

16: /cgn2_6/ptodata/2/pubpna/US10D_PUBCOMB.seq:* 

17 : /cgn2_6/ptodata/2/pubpna/US10E_PUBCOMB. seq:* 

18 : /cgn2_6/ptodata/2/pubpna/US10F_PUBCOMB. seq: * 

19: /cgn2_6/ptodata/2/pubpna/US10_NEW_PUB. seq: * 

20: /cgn2_6/ptodata/2/pubpna/USll_NEW_PUB. seq: * 

21: /cgn2_6/ptodata/2/pubpna/US60_NEW_PUB. seq: * 

22 : /cgn2_6/ptodata/2/pubpna/US60_PUBCOMB. seq: + 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 



i\t; o U _L L, 
Mo 


o co r e 


Query 
Match 


JjCIIU L.I1 


DB 


ID 


P)pk o /-> -j -r\ -j - ■} OTI 

J_/ \^ O V J 1_ V-/ l_» _1_ 1 


1 
X 


572 . 5 


81. 


2 


1160 


14 


US-10-17 5- 523-156 


Spnupnpp 1 56 Add 

\J \A. ^ liu J- \s , ^*tr tr 


o 


S79 S 


81. 


2 


1 7QR 


1 8 




^ pnn f^n ce± 1 0 Add 1 


3 


571.5 


81. 


1 


668 


17 


US-10-264-237-163 


^ ecru pnrp 163- Add 


4 


566 


80. 


3 


1785 


17 


US-10-439-388-62 


Spnupnrp 62. AddI 


5 


559 . 5 


79. 


4 


2610 


17 


Us- 10- 64 1-64 3-3 8 2 


Sequence 382 , App 


6 


531 


75. 


3 


431 


9 


US- 09- 960-352-22 05 


Sequence 2205, Ap 


7 


531 


75. 


3 


799 


17 


Us- 10- 660- 94 6-2 


Sequence 2, Appli 


8 


525 


74 . 


5 


994 


11 


US-09-978-360A-110 


Sequence 110, App 


9 


500 


70. 


9 


389 


9 


US- 09- 9 60-352-51 5 4 


Sequence 5154, Ap 


10 


470.5 


66. 


7 


639 


9 


US-09-880-107-3484 


Sequence 3484, Ap 


11 


447 


63. 


4 


3579 


9 


US-09-789-386-1 


Sequence 1, Appli 


12 


447 


63. 


4 


3579 


9 


US-09-893-348-22 


Sequence 22, Appl 


13 


447 


63. 


4 


3579 


17 


US-10-267-502-212 


Sequence 212, App 


14 


447 


63. 


4 


3579 


18 


US-10-327-213-8 


Sequence 8, Appli 


15 


447 


63. 


4 


3579 


18 


US-10-466-258-8 


Sequence 8, Appli 


16 


447 


63. 


4 


3579 


18 


US-10-810-653-22 


Sequence 22, Appl 


17 


447 


63. 


4 


4053 


9 


US-09-758-140-5 


Sequence 5, Appli 


18 


447 


63. 


4 


4053 


9 


US-09-972-599A-5 


Sequence 5, Appli 


19 


447 


63. 


4 


4053 


18 


US-10-717-597-310 


Sequence 310, App 


20 


447 


63. 


4 


4632 


14 


US-10-060-036-53 


Sequence 53, Appl 


21 


447 


63. 


4 


4684 


9 


US-09-893-348-17 


Sequence 17, Appl 


22 


447 


63. 


4 


4684 


18 


US-10-810-653-17 


Sequence 17, Appl 


23 


443 


62. 


8 


1122 


9 


US-09-789-386-5 


Sequence 5, Appli 


24 


443 


62. 


8 


1610 


9 


US-09-765-205-5 


Sequence 5, Appli 


25 


443 


62. 


8 


2052 


18 


US-10-466-258-3 


Sequence 3, Appli 


26 


443 


62. 


8 


2235 


14 


US-10-060-036-54 


Sequence 54, Appl 


27 


443 


62. 


8 


2782 


15 


US-10-205-194-165 


Sequence 165, App 


28 


441 


62. 


6 


1980 


17 


US-10-220-891-22 


Sequence 22, Appl 


29 


436.5 


61. 


9 


3492 


17 


US-10-267-502-214 


Sequence 214, App 


30 


383 


54. 


3 


1514 


9 


US-09-823-245A-349 


Sequence 349, App 


31 


349 


49. 


5 


1473 


15 


US-10-205-194-128 


Sequence 128, App 


32 


349 


49. 


5 


1502 


15 


US-10-205-219-94 


Sequence 94, Appl 


33 


349 


49. 


5 


1520 


15 


US-10-084-817-333 


Sequence 333, App 


34 


340 


48. 


2 


422 


9 


US-09-960-352-11567 


Sequence 11567, A 


35 


337 


47. 


8 


2331 


17 


US-10-267-502-213 


Sequence 213, App 



36 


337 


47. 


8 


2343 


17 


US 


-10 


-267 


-502-215 


Sequence 


215, App 


37 


337 


47. 


8 


3202 


9 


US- 


09- 


954- 


456-210 


Sequence 


210, App 


38 


337 


47. 


8 


3202 


17 


US 


-10 


-172 


-118-386 


Sequence 


386, App 


39 


337 


47. 


8 


3202 


17 


us 


-10 


-342 


-887-386 


Sequence 


386, App 


40 


337 


47. 


8 


3202 


18 


us 


-10 


-723 


-860-1480 


Sequence 


1480, Ap 


41 


337 


47. 


8 


3202 


19 


us 


-10 


-843 


-641A-3237 


Sequence 


3237, Ap 


42 


337 


47. 


8 


3305 


18 


us 


-10 


-723 


-860-5926 


Sequence 


5926, Ap 


43 


335 


47. 


5 


60615 


18 


us 


-10 


-633 


-423-9 


Sequence 


9, Appli 


44 


335 


47. 


5 


60615 


18 


us 


-10 


-427 


-741-9 


Sequence 


9, Appli 


45 


311 


44. 


1 


3517 


17 


us 


-10 


-302 


-172-111 


Sequence 


111, App 



ALIGNMENTS 



RESULT 1 

US-10-175-523-156 

Sequence 156, Application US/10175523 
Publication No. US20030096264A1 
GENERAL INFORMATION: 
APPLICANT: Brockman, Jeffrey 
APPLICANT: Evans, David 
APPLICANT: Hook, Derek 
APPLICANT: Klimczak, Leszek 
APPLICANT: Laeng, Pascal 
APPLICANT: Palf reyman, Michael 
APPLICANT: Raj an, Prithi 

TITLE OF INVENTION: MULT I -PARAMETER HIGH THROUGHPUT SCREENING ASSAYS (MPHTS) 
FILE REFERENCE: 3235/ 1 J7 95-US3 
CURRENT APPLICATION NUMBER: US/ 10/ 175 , 523 
CURRENT FILING DATE: 2002-06-18 
PRIOR APPLICATION NUMBER: US 60/299,151 
PRIOR FILING DATE: 2001-06-18 
PRIOR APPLICATION NUMBER: US 60/317,828 
PRIOR FILING DATE: 2001-09-07 
PRIOR APPLICATION NUMBER: US 60/325,150 
PRIOR FILING DATE: 2001-09-25 
PRIOR APPLICATION NUMBER: US 60/333,047 
PRIOR FILING DATE: 2001-11-14 
PRIOR APPLICATION NUMBER: US 60/349,936 
PRIOR FILING DATE: 2002-01-18 
PRIOR APPLICATION NUMBER: US 60/361,834 
PRIOR FILING DATE: 2002-03-04 
NUMBER OF SEQ ID NOS : 197 
SOFTWARE: Patentln version 3.1 
SEQ ID NO 156 
LENGTH: 1160 
TYPE: DNA 

ORGANISM: Homo sapiens 
US-10-175-523-156 



Alignment Scores: 

Pred. No. : 2.07e-68 

Score: 572.50 

Percent Similarity: 90.00% 
Best Local Similarity: 87.86% 

Query Match: 81.21% 

DB: 14 



Length: 1160 

Matches: 123 

Conservative: 3 

Mismatches: 11 

Indels: 3 

Gaps : 1 



GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - nucleic search, using f rame_plus_p2n model 



Run on: 



April 16, 2005, 11:34:27 



Search time 1246.52 Seconds 

(without alignments) 

4305.640 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 



US-09-830-972-32 
705 

1 QAS GEAGVS C L REN FAVY S V 



ESEVAI SEELVQKYSNSALG 141 



Scoring table: 



BLOSUM62 



Xgapop 10.0 , Xgapext 

Ygapop 10.0 , Ygapext 

Fgapop 6.0 , Fgapext 

Delop 6.0 , Delext 



0.5 
0.5 
7.0 
7.0 



Searched: 



34239544 seqs, 19032134700 residues 



Total number of hits satisfying chosen parameters: 



68479088 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Command line parameters : 
-MODEL=f rame+_p2n. model -DEV=xlh 

Q=/cgn2_l/USPTO__spool_h/US09830972/runat_14042005_094 843_18360/app_query. fasta_l 
.3683 

-DB=EST -QFMT=fastap -SUFFIX=rst -MINMATCH=0 . 1 -LOOPCL=0 -LOOPEXT=0 
-UNITS=bits -START=1 -END=-1 -MATRIX=blosum62 -TRANS=human4 0 . cdi ~LIST=45 
-DOCALIGN=200 -THR_SCORE=pct -THR_MAX=100 -THR_MIN=0 -ALIGN=15 -MODE=LOCAL 
-0UTFMT=pto -NORM=ext -HEAPSIZE=500 -MINLEN=0 -MAXLEN=2 000000000 
-USER=US09830972_@CGN_l_l_13747_@runat_14042005_094843_18360 -NCPU=6 -ICPU=3 
-NO_MMAP -LARGEQUERY -NEG_SCORES=0 -WAIT -DSPBLOCK=100 -LONGLOG 

-DEV_TIMEOUT=120 -WARN_TIMEOUT=30 -THREADS=1 -XGAPOP=10 -XGAPEXT=0.5 -FGAPOP=6 
-FGAPEXT =7 -YGAPOP=10 -YGAPEXT=0.5 -DELOP=6 -DELEXT=7 

Database : EST:* 



1 
2 
3 
4 
5 
6 
7 
8 
9 



gb 
gb 
gb 
gb 
gb 
gb 
gb 
gb 
gb 



estl: * 
est2 : * 
'htc:* 
est3:* 
est4 : * 
est5:* 
est6 : * 
gssl : * 
gss2 : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 



No . 


Score 


Match 


Length 


DB 


ID 


Description 


1 


680, 


.5 


96 


.5 


751 


7 


CK357937 


CK357937 


AGENCOURT 


2 


651, 


.5 


92 


.4 


695 


7 


CV077130 


CV077130 


AGEN COURT 


3 


644, 


.5 


91 


.4 


791 


7 


CF977898 


CF977898 


F26A06_04 


4 


635, 


.5 


90 


.1 


718 


1 


AA986233 


AA986233 


uc73gl2.y 


5 


635, 


.5 


90 


.1 


813 


5 


BU848611 

J_J v U i U v i. J. 


BU848611 


AGENCOURT 


6 


635, 


.5 


90, 


.1 


921 


5 


BU503291 


BU503291 


AGENCOURT 


7 


627, 


.5 


89 


.0 


595 


7 


CK617525 


CK617525 


mk06dl0.y 


3 


627, 


.5 


89 


.0 


826 


7 


CK791443 


CK791443 


AGENCOURT 


Q 


622, 


.5 


88 


.3 


424 


5 


BY2 62542 


BY262542 


BY262542 


i n 

x \j 


618, 


.5 


87 


.7 


74 9 


4 




BG296048 


602393712 


r 11 

XX 


609, 


.5 


86 


.5 


1054 


7 


CR755232 


CR755232 


CR755232 


1 7 

X 


604, 


.5 


85 


.7 


857 


4 


RI872386 


BI872386 


603397035. 


1 ^ 


581, 


.5 


82 


.5 


799 


1 


AU080133 


AU080133 


AU080133 


1 4 


579 


.5 


82 


.2 


504 


2 


RF671520 


BF671520 


602151450 


1 S 


579, 


.5 


82 


.2 




9 
c. 


BF670363 


BF670363 


602150242 


X O 


579, 


.5 


82 


.2 


678 


2 


BF672234 


BF672234 


602150421 


1 7 

X f 


579, 


.5 


82 


.2 


740 


4 


RG698 8 39 


BG698839 


602703243 


1 ft 
X o 


579 


82 


.1 


1057 


7 


CR753971 


CR753971 


CR753971 - 


19 


578 


.5 


82 


.1 


624 


2 


BE278557 


BE278557 


601156079 




578 


.5 


82 


.1 


766 


2 


BF575611 

U J- — ' / J U 1 1 


BF575611 


602133008 


21 


578 


.5 


82 


.1 


794 


1 


AU080127 


AU080127 


AU080127 


22 


578 


82 


.0 


805 


7 


C0735185 


C0735185 


SlLE04clO 


23 


576 


.5 


81 


.8 


876 


2 


BF574724 


BF574724 


602134511 


24 


575 


.5 


81 


.6 


747 


4 


BG623462 


BG623462 


602648520 


25 


574 


.5 


81 


.5 


472 


2 


BE969918 


BE969918 


601680007 


26 


573 


.5 


81 


.3 


822 


2 


BF695222 


BF695222 


602080446 


27 


573 


.5 


81 


.3 


834 


2 


BF574870 


BF574870 


602132290 


28 


572 


.5 


81 


.2 


460 


6 


CB147989 


CB147989 


K-EST0204 


29 


572 


.5 


81 


.2 


494 


6 


CA843655 


CA843655 


ir50c08.y 


30 


572 


.5 


81 


.2 


502 


2 


BF576341 


BF576341 


602134222 


31 


572 


.5 


81 


.2 


533 


4 


BG402094 


BG402094 


602465626 


32 


572 


.5 


81 


.2 


571 


6 


CD723962 


CD723962 


oj28f02.y 


33 


572 


.♦5 


81 


.2 


573 


2 


BE018510 


BE018510 


bb81d09.y 


34 


572 


.5 


81 


.2 


588 


5 


BQ271824 


BQ271824 


ikl5a03.y 


35 


572 


.5 


81 


.2 


588 


6 


CB070370 


CB070370 


is36b08.y 


36 


572 


.5 


81 


.2 


598 


6 


CB153870 


CB153870 


K-EST0211 


37 


572 


.5 


81 


.2 


599 


6 


CB156990 


CB156990 


K-EST0216 


38 


572 


.5 


81 


.2 


631 


6 


CD701478 


CD701478 


EST18002 


39 


572 


.5 


81 


.2 


633 


4 


BG618709 


BG618709 


602645164 


40 


572 


.5 


81 


.2 


647 


5 


BU951067 


BU951067 


io75d08 . y 


41 


572 


.5 


81 


.2 


661 


4 


BG434277 


BG434277 


602506242 


42 


572 


.5 


81 


.2 


665 


7 


CK005520 


CK005520 


AGENCOURT 


43 


572 


.5 


81 


.2 


670 


4 


BG563079 


BG563079 


602581868 


44 


572 


.5 


81 


.2 


676 


4 


BI601268 


BI601268 


603245789 


45 


572 


.5 


81 


.2 


713 


4 


BG427864 


BG427864 


602501551 



ALIGNMENTS 



GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - nucleic search, using f rame_plus_p2n model 



Run on: 



April 16, 2005, 11:30:21 



Search time 14217.1 Seconds 

(without alignments) 

3963.787 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 



US-09-830-972-2 
5848 

1 MEDIDQSSLVSSSTDSPPRP 



VKDAMAKIQAKI PGLKRKAD 1163 



Scoring table: 



BLOSUM62 



Xgapop 10.0 , Xgapext 

Ygapop 10.0 , Ygapext 

Fgapop 6.0 , Fgapext 

Delop 6.0 , Delext 



0.5 
0.5 
7.0 
7.0 



Searched: 



4708233 seqs, 24227607955 residues 



Total number of hits satisfying chosen parameters: 



9416466 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Command line parameters: 
-MODEL=f rame+_p2n. model -DEV=xlh 

Q=/cgn2_l/USPTO_spool_h/US09830972/runat_14042005_094 842_1834 8/app_query. fasta 
.3683 

-DB=GenEmbl -QFMT-fastap -SUFFIX=rge -MINMATCH=0 . 1 -LOOPCL=0 -LOOPEXT=0 
-UNITS=bits -START=1 -END=-1 -MATRIX=blosum62 -TRANS=human4 0 . cdi -LIST=45 
-DOCALIGN=200 -THR_SCORE=pct -THR_MAX=100 -THR_MIN=0 -ALIGN=15 - MO DE= LOCAL 
-OUTFMT=pto -NORM=ext -HEAPSIZE=500 -MINLEN=0 -MAXLEN=2 000000000 
-USER=US09830972_@CGN_l_l_19954_@runat_14042005_094842_1834 8 -NCPU=6 -ICPU=3 
-NO_MMAP - LARGEQU ERY -NEG_SCORES=0 -WAIT -DSPBLOCK-100 -LONGLOG 

-DEV_TIMEOUT=120 -WARN_TIMEOUT=30 -THREADS=1 -XGAPOP=10 -XGAPEXT=0 . 5 -FGAPOP=6 
-FGAPEXT =7 -YGAPOP=10 -YGAPEXT=0 . 5 -DELOP=6 -DELEXT=7 

Database : GenEmbl : * 



1 
2 
3 
4 
5 
6 
7 
8 
9 



10: 



gb__ba : * 
gb_htg : * 
gb_in : * 
gb_om: * 
gb_ov: * 
gb_pat : * 
gb_ph : * 
gb_pl : * 
gb_pr : * 
gb_ro : * 



11: gb_sts:* 
12: gb_sy:* 
1 3 : gb_un : * 
1 4 : gb_vi : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



1 

1 


C O A O 

bo4 o 


1 AA 

1UU . 


0 


O A O A 

o4 o y 


b 


AX / bbU4 b 


AX / bbl)4 b 


Sequence 


o 

z 


C O A O 

DO 4 o 


1U0 . 


A 
0 


o4yz 


b 


Lybz ybu / 


cyoz y bU / 


Sequence 


3 


5848 


T A A 

100 . 


0 


4684 


"1 A 

10 


RNOz4z9d1 


7V TO ytOQCI 

A J z 4 z y b 1 


Rattus no 


4 


b31z . 0 


A A 

y u . 


o 


/l C O T 

4bz / 


1 A 

1U 


AY lUzz o4 


AVI nooQ/i 
AY1UZZ84 


Mus muscu 


5 


5307 


90 . 


7 


3821 


10 


AY114152 


ji v1 1 /IT CO 

AY1 14 lbz 


Mus muscu 


6 


C 1 A O C 

5303 . o 


A A 

90 . 


■"7 


A C 1 O 

4blo 


1 A 

10 


BCUbbo / o 


bCUbbo /o 


Mus muscu 


1 


4 501 . o 


1 / . 


0 


yj A c o 

4Uo3 


T A 
10 


AYlUzzo U 


7iV1 AOOGA 

AY 1UZZ oU 


Mus muscu 


0 


A A A O C 

4 4 03 . b 


/ b . 


3 


O A 1 A 

3919 


b 


CQozy4 ob 


L.Q oZ y 4 0 b 


Sequence 


9 


4403.5 


75 . 


3 


4053 


6 


AXiybz4y 


Axiybz4 y 


Sequence 


10 


4403.5 


75 . 


3 


4053 


9 


7inAOA/"AO 

AB020693 


7VT5AOA(CAO 

ABOz Ud93 


Homo sapi 


11 


4403.5 


1 b . 


3 


4166 


9 


AB0404 dz 


7,nA / A A C**> 

A±JU4 U4 bz 


Homo sapi 


1 o 

lz 


A A A O C 

4 4 U J . b 


/ b . 




/ coo 
4 boz 


A 

y 


Ar 1 4 0 b 0 / 


Ar 1 4 0 D 6 I 


Homo sapi 


13 


A A A O C 

4 4U J . b 


/ b . 




A T O A 

4 / o y 


b 


rr\Q n /inn 
uyo / 4 Ul / 


Lyo / 4 U 1 / 


Sequence 


14 


4 403 . b 


/ b . 


3 


/I 1 O A 

4 / o y 


A 

y 


AY 1 U z Z / y 


7\V1 AOOTQ 

AY 1 U Z z f y 


Homo sapi 


15 


4398 . 5 


75 . 


o 

2 


3b /o 


D 


AX / DbObO 


AX / bbUbU 


Sequence 


Id 


a o a o t 

4 iy 0 . b 


/ b . 


o 
Z 


O R T A 

ob / y 


b 


bUz4y44b 


dDZ 4 y 4 4 b 


Protein s 


1 7 


>t o a o c 

4 398 . b 


/ b . 


o 

z 


O C T A 

3b /y 


y 


nbAzblooo 


7\ TORT OOQ 

A J Z b 1 O 0 O 


Homo sapi 


1 o 


/I 001 c 


1 A 

1 4 . 


a 

y 


/I AQI 

4uyo 


b 


dUZ / UU / u 


JdJJZ / UU / u 


S ecreteol 


i y 


4 ob U . b 


1 A 

I 4 . 


A 
4 


A ft 9 9 

4oZZ 


b 


AKZ Z U 0 D 0 


AKZ 6U0DJ 


C? /->#• 1 1 f-\ w /-i 

oc(juencc 


o n 
Z U 


4 JZ o . b 


1 "3 
/ O . 


A 

y 


oo lb 


1U 


nr* n "5 0 0 io 
dLUjZZ / Z 


Dpnio OlO 
C5UU OZ Z / Z 


i v iu s mu 5CU 


9 1 

Z 1 


o y o i 


b / . 


z 




Z 


AC 1 0 1 4 0 1 


API 1 Zl ^ 1 


KaLLUS no 


9 9 

zz 


o y o j. 


D / • 


z 


0 "5 ft ^ Zl 1 


o 


API OOOI C 




D -1 -|- -|- 1 1 c no 
naLLUb LLKJ 


9 "3 

Zo 


"5 ft 1A 

JO Ji 


Dj . 


D 


9 A 9 ^ 
Z ft Z O 


D 


L-^ D X fl O Z / 


POP 1 A ^01 


OCLJUClH-C 


0 A 
Z H. 


>3 / Z O • <J 


DO . 


n 
i 




Q 
_7 




a yi 9 ^9 4 ft 


il v/iLLt-' OCtkvX 


25 


3725.5 


63. 


1 


4102 


9 


AY123245 


AY123245 


Homo sapi 


26 


3720.5 


63. 


6 


3491 


9 


AF333336 


AF333336 


Homo sapi 


27 


3714 


63. 


5 


4123 


9 


AY123247 


AY123247 


Homo sapi 


28 


3711 


63. 


5 


4160 


9 


AY123246 


AY123246 


Homo sapi 


29 


3708.5 


63. 


4 


4070 


9 


AY123249 


AY123249 


Homo sapi 


30 


3700 


63. 


3 


4060 


9 


AY123250 


AY123250 


Homo sapi 


31 


3584 


61. 


3 


2248 


6 


CQ814526 


CQ814526 


Sequence 


32 


3575 


61. 


1 


2278 


6 


CQ814528 


CQ814528 


Sequence 


33 


3495.5 


59. 


8 


60615 


10 


AY102286 


AY102286 


Mus muscu 


34 


3495.5 


59. 


8 


166516 


2 


AC135510 


AC135510 


Mus muscu 


35 


3495.5 


59. 


8 


211357 


2 


AC113284 


AC113284 


Mus muscu 


36 


3495.5 


59. 


8 


212042 


' 10 


AL929371 


AL929371 


Mouse DNA 


37 


3447.5 


59. 


0 


2883 


9 


AF320999 


AF320999 


Homo sapi 


38 


3407. 5 


58. 


3 


2974 


6 


CQ716296 


CQ716296 


Sequence 


39 


3065 


52. 


4 


2958 


10 


BC032192 


BC032192 


Mus muscu 


40 


2737 .5 


46. 


8 


2481 


9 


AF063601 


AF063601 


Homo sapi 


41 


2737.5 


46. 


8 


39674 


9 


AC092461 


AC092461 


Homo sapi 


42 


2737.5 


46. 


8 


90756 


9 


AY102285 


AY102285 


Homo sapi 


43 


2737.5 


46. 


8 


162692 


. 2 


AC016171 


AC016171 


Homo sapi 



44 2632.5 45.0 2386 6 BD190738 BD190738 Secreted 

45 2632.5 45.0 2386 6 AX099401 AX099401 Sequence 



ALIGNMENTS 



RESULT 1 
AX766046 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 

FEATURES 

source 



exon 
ORIGIN 



AX766046 
Sequence 1 
AX766046 
AX766046.1 



3489 bp 
from Patent WO03002602. 

GI:32260128 



DNA 



linear PAT 25-JUN-2003 



Rattus norvegicus (Norway rat ) 
Rat t us norvegicus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi; Muridae; Murinae; 
Rattus . 
1 

Eisenbach-Schwartz,M. and Hauben,E. 

Nogo and nogo receptor derived peptides for t-cell mediated 
neuroprotection 

Patent: WO 03002602-A 1 09-JAN-2003; 

YEDA RESEARCH AND DEVELOPMENT Co. LTD. (IL) 

Location/Qualifiers 

1. .3489 

/organism="Rattus norvegicus" 
/mol_type="unas signed DNA" 
/db_xref="taxon: 10116" 
1. .3489 



Alignment Scores: 

Pred. No.: 3.79e-200 

Score: 5848.00 

Percent Similarity: 100.00% 
Best Local Similarity: 100.00% 

Query Match: 100.00% 

DB: 6 



Length: 3489 

Matches: 1163 

Conservative: 0 

Mismatches: 0 

Indels: 0 

Gaps: 0 



US-09-830-972-2 (1-1163) x AX766046 (1-3489) 



Qy 

Db 



1 MetGluAspIleAspGlnSerSerLeuValSerSerSerThrAspSerProProArgPro 20 

I I I I I I I I I I I I I I I I I I II I II I I II II I II I I I I II I I II I I I I I I I I I. I I I I I I II I 
1 ATGGAAGACATAGACCAGTCGTCGCTGGTCTCCTCGTCCACGGACAGCCCGCCCCGGCCT 60 



Qy 

Db 

Qy 

Db 



21 



61 



41 



ProProAlaPheLysTyrGlnPheValThrGluProGluAspGluGluAspGluGluGlu 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I II I I I I I 
C C GCC CGC CTT CAAGT AC CAGTT C GT GACGGAGC C C GAGGAC GAGGAG GAC GAGGAG GAG 



GluGluAspGluGluGluAspAspGluAspLeuGluGluLeuGluValLeuGluArgLys 

I I I I I I I I I I II II I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 
121 GAGGAGGAC GAGGAGGAGGAC GAC GAGGAC CT AGAG GAACT GGAGGT GCT GGAGAGGAAG 



40 



120 



60 



180 



Qy 

Db 



61 ProAlaAlaGlyLeuSerAlaAlaAlaValProProAlaAlaAlaAlaProLeuLeuAsp 80 
I I I I I I I II I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I 
181 CCCGCAGCCGGGCTGTCCGCAGCTGCGGTGCCGCCCGCCGCCGCCGCGCCGCTGCTGGAC 240 



GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - nucleic search, using f rame_plus_p2n model 

Run on: April 16, 2005, 11:29:46 ; Search time 1645.69 Seconds 

(without alignments) 
4183.459 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



Searched: 



US-09-830-972-2 
5848 

1 MEDIDQSSLVSSSTDSPPRP VKDAMAKIQAKIPGLKRKAD 1163 

BLOSUM62 

Xgapop 10.0 , Xgapext 0.5 
Ygapop 10.0 , Ygapext 0.5 
Fgapop 6.0 , Fgapext 7.0 
Delop 6.0 , Delext 7.0 

4390206 seqs, 2959870667 residues 



Total number of hits satisfying chosen parameters: 8780412 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

* Command line parameters: 
-MODEL=f rame+_p2n. model -DEV=xlh 

Q=/cgn2_l/USPTO_spool_h/US09830972/runat_14 042005_094842_18342/app_query. fasta_l 
.3683 

-DB=N_Geneseq -QFMT=fastap -SUFFIX=rng -MINMATCH=0 . 1 -LOOPCL=0 -LOOPEXT=0 
-UNITS=bits - START =1 -END=-1 -MATRIX=blosum62 -TRANS=human4 0 . cdi -LIST=45 
-DOCALIGN-200 -THR_SCORE=pct -THR_MAX=100 -THR__MIN=0 -ALIGN=15 -MO DE= LOCAL 
-OUTFMT=pto -NORM=ext -HEAPSIZE=500 -MINLEN=0 -MAXLEN=2000000000 
-USER=US09830972_@CGN_l_l__1955_@runat_14042005_094 842_18342 -NCPU=6 -ICPU=3 
-NO_MMAP -LARGEQUERY -NEG_SCORES=0 -WAIT -DSPBLOCK=100 -LONGLOG 

-DEV_TIMEOUT=120 -WARN_TIMEOUT=30 -THREADS=1 -XGAPOP=10 -XGAPEXT=0 . 5 -FGAPOP=6 
-FGAPEXT=7 -YGAPOP=10 -YGAPEXT=0.5 -DELOP=6 -DELEXT=7 



Database : 



N_Geneseq_16Dec04 : 



1 
2 
3 
4 
5 
6 
7 
8 
9 

10 



geneseqnl980s : * 
geneseqnl990s : * 
geneseqn2000s : * 
geneseqn2001as : * 
geneseqn2001bs : * 
geneseqn2002as : * 
geneseqn2002bs : * 
geneseqn2003as : * 
geneseqn2003bs : * 
geneseqn2003cs : ^ 



11 

12 
13 



geneseqn2003ds : * 
geneseqn2004as : * 
geneseqn2004bs : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



1 


5848 


100 . 


0 


3492 


12 


ADP45571 


Adp455/1 Rat NogoA 


2 


5848 


100 . 


0 


4684 


3 


AAD01173 


Aad01173 Rat neuri 


3 


5848 


100 . 


0 


4684 


6 


ABN86600 


Abn8 6600 Rat neuro 


4 


5307 


90 . 


7 


3492 


12 


ADO07888 


AdoO/ooo Mouse pol 


5 


4403. 


5 


75 . 


3 


3919 


12 


ADP45550 


Adp4b5bU Human Nog 


6 


4403, 


5 


75 . 


3 


4053 


4 


AAS09453 


Aas 09453 Human cDN 


7 


4403 


5 


75 . 


3 


4053 


9 


ACC81048 


Acc81048 Human Nog 


8 


4403 


5 


75 . 


3 


4053 


12 


ADP13574 


Adpl3574 Renal eel 


9 


4403 


5 


75. 


3 


4632 


6 


ABV9 4 680 


Abv94680 Human pan 


10 


4403 


5 


75. 


3 


4632 


10 


ADG32772 


Adg327 72 Human DNA 


11 


4403 


5 


75 . 


3 


4789 


13 


ADR83534 


Adr83534 Human ret 


12 


4398 


5 


75. 


2 


3579 


3 


AAZ56886 


Aaz56886 Human MAG 


13 


4398 


5 


75 . 


2 


3579 


4 


AAF90324 


Aaf90324 Human NOG 


14 


4398 


5 


75 . 


2 


3579 


6 


ABK90134 


Abk90134 DNA encod 


15 


4398 


5 


. 75 . 


2 


3579 


6 


ABN86601 


Abn86601 Human neu 


16 


4398 


5 


75. 


2 


3579 


12 


ADO07886 


Ado07886 Human pol 


17 


4398 


5 


75. 


2 


3579 


12 


ADR13965 


Adrl3965 Human NOG 


18 


4381 


5 


74 . 


9 


4093 


3 


AAA23454 


7\^^00/1 ["/I _ T""\\T 7\ _ „ _ _. 

Aaa234 54 cDNA enco 


19 


4350 


.5 


74 . 


4 


4822 


6 


ABS70449 


Abs /044y Human Don 


20 


4329 


74 . 


0 


4698 


8 


ABX34563 


Adxo4odo Human mad 


21 


3834 


65 . 


6 


2425 


12 


AD02 6412 • 


Aao^o4±z Hat trunc 


22 


3584 


61 . 


3 


2248 


12 


AD026411 


Aaozo4ll Kat trunc 


23 


3575 


61 . 


1 


O O 1 Q 

22 to 


12 


AD02 o4 1 3 


acio^d41o Kau trunc 


O A 

24 


3495 


.5 


59 . 


o 
o 




13 




AutoyjoD lYius muscu 


25 


3413 


DO . 


4 


O O 1 o 


3 


"A 7\ T>n 1 1 "7 /I 


AaauJ.i/4 tsovine ne 


26 


2632 


.5 


45 . 


0 


2386 


2 


AAV3092 0 


AavJ0 92U Human sec 


27 


2623 


.5 


44. 


9 


2386 


5 


AAF98399 


Aaf98399 Human cDN 


28 


1975 


33. 


8 


1980 


4 


AAI98079 


Aai98 07 9 Human neu 


29 


1513 


25. 


9 


2782 


10 


ADB85284 


Adb85284 Rat fooce 


30 


1191 


20. 


4 


1122 


3 


AAZ56888 


Aaz56888 Human. MAG 


31 


1191 


20. 


4 


1122 


4 


AAF90325 


Aaf90325 Human NOG 


32 


1191 


20. 


4 


1216 


6 


ABA05903 


Aba05903 Human RTN 


33 


1191 


20. 


4 


1599 


10 


ADI62860 


Adi62860 Human apo 


34 


1191 


20. 


4 


1610 


3 


AAZ36230 


Aaz36230 cDNA enco 


35 


1191 


20. 


4 


2235 


6 


ABV94681 


Abv94681 Human pan 


36 


1191 


20. 


4 


2240 


3 


AAC64406 ■ 


Aac64 406 Human Nog 


37 


11 


33 


20. 


2 


2052 


6 


ABK90133 


Abk90133 DNA encod 


38 


1174 


20. 


1 


1694 


4 


AAK94408 


Aak94408 Human ful 


39 


1174 


20 


1 


1694 


12 


ADL31137 


Adl31137 Full leng 


40 


1035 


.5 


17 


7 


1683 


4 


AAD08386 


Aad08386 Human sec 


41 


925 


15 


8 


1568 


3 


AAD01175 


Aad01175 Rat neuri 


42 


917 


15 


7 


734 


12 


ADF42781 


Adf42781 Mouse CYP 


43 


916 


.5 


15 


7 


1758 


4 


AAF32725 


Aaf32725 Human sec 


44 


911 


.5 


15 


6 


2100 


8 


ABT42936 


Abt42936 Human neu 



45 908 ' 15.5 600 4 AAF90323 



Aaf90323 Human NOG 



ALIGNMENTS 

RESULT 1 
ADP45571 

ID ADP45571 standard; cDNA; 3492 BP. 
XX 

AC ADP45571; 
XX 

DT 09-SEP-2004 (first entry) 
XX 

DE Rat NogoA encoding cDNA SEQ ID NO: 25. 
XX 

KW binding molecule; human; NogoA; NiG; NiG-D20; NogoA_623-640 ; 
KW nerve repair; neuroprotective; gene therapy; 

KW central nervous system injury; CNS injury; neurodegenerative disorder; 

KW rat; gene; ss. 

XX 

OS Rattus norvegicus. 
XX 

FH Key Location/Qualif iers 

FT CDS 1. .3492 

FT /*tag= a 

FT /product= "NogoA" 

XX 

PN WO2004052932-A2 . 
XX 

PD 24-JUN-2004. 
XX 

PF 09-DEC-2003; 2003WO-EP013960 . 
XX 

PR 10-DEC-2002; 2002GB-00028832 . 
XX 

PA (NOVS ) NOVARTIS AG. 

PA (NOVS ) NOVARTIS PHARMA GMBH. 

PA (UYZU-) UNIV ZUERICH. 

XX 

PI Barske C, Mir AK, Oertle T, Schnell L, Schwab ME, Vitaliti A; 

PI Zurini M; 

XX 

DR- WPI; 2004-468818/44. 
DR P-PSDB; ADP45572. 
XX 

PT New binding molecule that binds to the human NogoA polypeptide, NiG, NiG 
PT D20 or NogoA623-640, useful, in preparing a composition for treating CNS 
PT injury or neurodegenerative disorders. 
XX 

PS Disclosure; SEQ ID NO 25; 121pp; English. 
XX 

CC The present invention describes a binding molecule which binds to human 
CC NogoA polypeptide, human NiG, human N1G-D20 or human NogoA_623-640 with 
CC dissociation constant of less than lOOOnM. Also described: (1) a 
CC polynucleotide encoding the binding molecule; (2) an expression vector o 
CC system comprising the polynucleotide; (3) a host cell comprising the 
CC expression system; (4) a pharmaceutical composition comprising the 



GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein 
Run on: 



- nucleic search, using f rame_plus_p2n model 

April 16, 2005, 11:35:57 ; Search time 507.898 Seconds 

(without alignments) 
3746.799 Million cell updates/sec 



Title: US-09-830-972-2 
Perfect score: 5848 
Sequence : 



1 MEDIDQSSLVSSSTDSPPRP VKDAMAKIQAKIPGLKRKAD 1163 



Scoring table: BLOSUM62 

Xgapop 10.0 , Xgapext 0.5 

Ygapop 10.0 , Ygapext 0.5 

Fgapop 6.0 , Fgapext 7 . 0 

Delop 6.0 , Delext 7.0 



Searched: 



1202784 seqs, 818138359 residues 



2405568 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100.% 
Listing first 45 summaries 

Command line parameters: 
-MODEL=frame+_p2n. model -DEV=xlh 

Q=/cgn2_l/USPTO_spool_h/US09830972/runat_14042005_094844_18372/app_query.fasta_l 
.3683 

-DB=Issued Patents_NA -QFMT=fastap -SUFFIX=rni -MINMAXCH=0 . 1 -LOOPCL=0 
-LOOPEXT=0~-UNITS=bits -START=1 -END=-1 -MATRIX=blosum62 -TRANS=human4 0 . cdi 
-LIST=45 -DOCALIGN=200 -THR SCORE=pct -THR_MAX=100 -THR_MIN=0 -ALIGN=15 
-MO DE= LOCAL -OUTB^=pto -NORM=ext -HEAPSIZE=500 -MINLEN=0 -MAXLEN=2 000000000 
-USER=US09830972_@CGN_l_l_255_@runat_14042005_094844_18372 -NCPU=6 -ICPU=3 
-NO MMAP -LARGEQUERY -NEG_SCORES=0 -WAIT -DSPBLOCK=100 -LONGLOG 

-DEV TIMEOUT=120 -WARN_TIMEOUT=30 -THREADS=1 -XGAPOP=10 -XGAPEXT=0.5 -FGAPOP-6 
-FGAPEXT =7 -YGAPOP=10 -YGAPEXT=0 . 5 -DELOP=6 -DELEXT=7 

Database : Issued_Patents_NA: * 

1: / C gn2_6/ptodata/l/ina/5A_COMB.seq: + 

2: /cgn2_6/ptodata/l/ina/5B_COMB.seq:* 

3 : /cgn2_6/ptodata/ 1/ina/ 6A_COMB . seq : * 

4: /cgn2_6/ptodata/l/ina/6B_COMB.seq:* 

5 : / C gn2_6/ptodata/l/ina/PCTUS_COMB. seq: * 

6 : /cgn2_6/ptodata/l/ina/backf ilesl . seq: * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 



No. Score Match Length DB ID Description 



1 


4350.5 


74. 


4 


4822 


3 


US-09-4 84-970B-106 


Sequence 


1 0 D , App 


2 


908 


15. 


5 


799 


2 


US-08-700-607-2 


Sequence 


2, Appli 


3 


908 


15. 


5 


1669 


4 


US-09-949-016-3253 


Sequence 


3253, Ap 


4 


895 


15. 


3 


2610 


4 


US-09-023-655-382 


Sequence 


382, App 


5 


797.5 


13. 


6 


3202 


4 


US-09-949-016-1127 


Sequence 


1127, Ap 


6 


741.5 


12. 


7 


3517 


4 


US-09-799-451-111 


Sequence 


111, App 


7 


724.5 


12. 


4 


2069 


4 


US-09-949-016-3309 


Sequence 


3309, Ap 


8 


625.5 


10. 


7 


1766 


3 


US-09-149-476-254 


Sequence 


254, App 


9 


625.5 


10. 


7 


2262 


4 


US-09-949-016-2988 


Sequence 


r\ AAA Ti 

2988, Ap 


10 


625.5 


10. 


7 


2664 


3 


US-09-149-476-255 


Sequence 


255, App 


11 


527.5 


9. 


0 


1095 


2 


US-08-700-607-4 


Sequence 


4, Appli 


12 


522.5 


8. 


9 


2014 


4 


US-09-270-767-13561 


Sequence 


13561, A 


13 


518.5 


8. 


9 


794 


3 


US-09-149-476-102 


Sequence 


102, App 


14 


490 


8. 


,4 


2181 


4 


US-09-949-016-1419 


Sequence 


1419, Ap 


15 


444.5 


7. 


,6 


428 


4 


US-09-513-999C-923 


Sequence 


923, App 


16 


345 


5. 


,9 


382 


4 


US-09-513-999C-11526 


Sequence 


11526, A 


17 


341 


5, 


.8 


601 


4 


US-09-94 9-016-117588 


Sequence 


117588, 


18 


341 


5. 


.8 


601 


4 


US-09-94 9-016-117 589 


Sequence 


117589, 


19 


341 


5. 


.8 


42075 


4 


US-09-949-016-14995 


Sequence 


14995, A 


20 


294.5 


5, 


.0 


145287 


4 


US-09-94 9-016-13530 


Sequence 


13530, A 


21 


294.5 


5. 


.0 


145287 


4 


US-09-94 9-016-13531 


Sequence 


13531, A 


22 


293 


5, 


;o 


443 


4 


US-09-513-999C-3784 


Sequence 


3784, Ap 


23 


288 


4, 


.9 


7571 


4 


US-09-94 9-016-4 366 


Sequence 


4366, Ap 


24 


288 


4, 


.9 


8146 


4 


US-09-976-594-725 


Sequence 


725, App 


25 


288 


4 


.9 


11917 


4 


US-09-566-921-32 


Sequence 


32, Appl 


26 


288 


4 


.9 


101300 


4 


US-09-949-016-16108 


Sequence 


16108, A 


27 


286 


4 


.9 


441 


4 


US-09-513-999O2227 


Sequence 


2227, Ap 


28 


285 


4 


.9 


135667 


4 


US-09-94 9-016-15051 


Sequence 


15051, A 


29 


285 


4 


. 9 


152486 


4 


US-09-94 9-016-12 8 69 


Sequence 


12869, A 


30 


282.5 


4 


.8 


4148 


4 


US-09-344-624-2 


Sequence 


2, Appli 


31 


279.5 


4 


.8 


601 


4 


US-09-949-016-40169 


Sequence 


40169, A 


32 


279.5 


4 


.8 


601 


4 


US-09-94 9-016-119335 


Sequence 


119335, 


33 


279.5 


4 


.8 


5361 


3 


US-08-973-462-2 


Sequence 


2, Appli 


34 


279.5 


4 


.8 


6152 


3 


US-08-973-462-1 


Sequence 


1, Appli 


35 


276 


4 


.7 


261 


2 


US-08-700-607-9 


Sequence 


9, Appli 


36 


275.5 


4 


.7 


8560 


4 


US-09-949-016-1788 


Sequence 


1788, Ap 


37 


275.5 


4 


.7 


8560 


4 


US-09-949-016-1789 


Sequence 


1789, Ap 


38 


271 


4 


.6 


13906 


4 


US-09-949-016-14730 


Sequence 


14730, A 


39 


267 


4 


.6 


6608 


4 


US-09-220-132-58 


Sequence 


58, Appl 


40 


265.5 


4 


.5 


6605 


1 


US-08-769-309A-4 


Sequence 


4, Appli 


41 


265.5 


4 


.5 


6605 


3 


US-08-994-570-4 


Sequence 


4, Appli 


42 


265 


4 


.5 


3561 


3 


US-09-134-001C-1685 


Sequence 


1685, Ap 


43 


265 


4 


.5 


6414 


3 


US-09-134-001C-1626 


Sequence 


1626, Ap 


44 


263.5 


4 


.5 


4484 


4 


US-09-949-016-4498 


Sequence 


4498, Ap 


45 


257 


4 


.4 


2835 


3 


US-09-134-001C-1515 


Sequence 


1515, Ap 



ALIGNMENTS 



RESULT 1 

US-09-484-970B-106 



GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - nucleic search, using f rame_plus_p2n model 



Run on: 



April 16, 2005, 22:42:49 ; 



Search time 1786.79 Seconds 

(without alignments) 

3948.747 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 



US-09-830-972-2 
5848 

1 MEDIDQSSLVSSSTDSPPRP 



VKDAMAKIQAKI PGLKRKAD 1163 



Scoring table: 



BLOSUM62 



Xgapop 10.0 , Xgapext 

Ygapop 10.0 , Ygapext 

Fgapop 6.0 , Fgapext 

Delop 6.0 , Delext 



0.5 
0.5 
7.0 
7.0 



Searched: 



5622541 seqs, 3033355566 residues 



Total number of hits satisfying chosen parameters: 11245082 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Command line parameters: 
-M0DEL-frame+_p2n. model -DEV=xlh 

Q=/cgn2_l/USPTO_spool__h/US09830972/runat_14042005_094847_18466/app_query.fasta_l 
.3683 

-DB=Published_Applications_NA -QFMT=fastap -SUFFIX=rnpb -MINMATCH=0 . 1 
-LOOPCL=0 -LOOPEXT=0 -UNITS=bits - START =1 -END=-1 -MATRIX=blosum62 
-TRANS=human40.cdi -LIST=45 -DOCALIGN=200 -THR_SCORE=pct -THR_MAX=100 
-THR MIN=0 -ALIGN=15 -MODE-LOCAL -OUTFMT=pto -NORM=ext -HEAPSIZE=500 -MINLEN=0 
-MAXLEN=2000000000 -USER=US09830972 J3CGN_l_l_2092_@runat JL4 042005_094 847_184 66 
-NCPU=6 -ICPU=3 -NO_MMAP -LARGEQUERY -NEG_SCORES=0 -WAIT -DSPBLOCK=100 
-LONGLOG -DEV_TIMEOUT=120 -WARN_TIMEOUT=30 -THREADS=1 -XGAPOP=10 -XGAPEXT=0.5 
-FGAPOP- 6 -FGAPEXT=7 -YGAPOP=10 -YGAPEXT=0.5 -DELOP=6 -DELEXT=7 



1: /cgn2_6/ptodata/2/pubpna/US07_PUBCOMB.seq:* 

2 : /cgn2_6/ptodata/2/pubpna/PCT_NEW_PUB . seq: * 

3: /cgn2_6/ptodata/2/pubpna/US06__NEW_PUB.seq:* 

4 : /cgn2_6/ptodata/2/pubpna/US06_PUBCOMB. seq: * 

5 : /cgn2_6/ptodata/2/pubpna/US07_NEW_PUB . seq: * 

6: /cgn2_6/ptodata/2/pubpna/PCTUS_PUBCOMB.seq:* 

7: /cgn2_6/ptodata/2/pubpna/US08_NEW_PUB.seq:* 

8: /cgn2_6/ptodata/2/pubpna/US08_PUBCOMB.seq:* 

9: /cgn2_6/ptodata/2/pubpna/US09A_PUBCOMB.seq:* 

10: /cgn2_6/ptodata/2/pubpna/US09B_PUBCOMB.seq:* 



Database 



Published_Applications_NA: * 



11: / C gn2_6/ptodata/2/pubpna/US09C_PUBCOMB.seq: 

12: /cgn2_6/ptodata/2/pubpna/US09JJEW_PUB.seq:* 

13: /cgn2_6/ptodata/2/pubpna/US10A_PUBCOMB.seq: 

14: /cgn2_6/ptodata/2/pubpna/US10B_PUBCOMB.seq: 

15 : / cgn2_6/p todat a / 2 /pubpna/US 1 0C_PUBCOMB . s eq : 

16: /cgn2_6/ptodata/2/pubpna/US10D_PUBCOMB.seq: 

17: /cgn2_6/ptodata/2/pubpna/US10E_PUBCOMB.seq: 

18: /cgn2_6/ptodata/2/pubpna/US10F_PUBCOMB.seq: 

19: /cgn2_6/ptodata/2/pubpna/US10_NEW_PUB.seq:* 

20: /cgn2_6/ptodata/2/pubpna/USll_NEW_PUB.seq:* 

21: /cgn2_6/ptodata/2/pubpna/US60_NEW_PUB.seq:* 

22: /cgn2_6/ptodata/2/pubpna/US60_PUBCOMB.seq:* 

Pred. No, is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 



Pocnl +" 
In \J * 


S core 


Query 
Match : 


Length 


DB 


ID 


1 

-L 


5848 


100.0 


4684 


9 


US-09-893-348-17 


9 


5848 


100. 0 


4684 


18 


US-10-810-653-17 


3 


5307 


90.7 


3492 


17 


US-10-267-502-214 


4 


4403.5 


75.3 


4053 


9 


US-09-758-140-5 


5 


4403.5 


75.3 


4053 


9 


US-09-972-599A-5 


6 


4403 . 5 


75.3 


4053 


18 


US-10-717-597-310 


7 


4403 . 5 


75.3 


4632 


14 


US-10-060-036-53 


8 


4398 . 5 


75.2 


3579 


9 


US-09-789-386-1 


9 


4398 . 5 


75.2 


3579 


9 


US-09-893-348-22 


10 


4398.5 


75.2 


3579 


17 


US-10-267-502-212 


11 


4398.5 


75.2 


3579 


18 


US-10-327-213-8 


12 


4398.5 


75.2 


3579 


18 


US-10-466-258-8 


13 


4398.5 


75.2 


3579 


18 


US-10-810-653-22 


14 


3495.5 


59.8 


60615 


18 


US-10-633-423-9 


15 


3495.5 


59.8 


60615 


18 


US-10-427-741-9 


16 


1975 


33.8 


1980 


17 


US-10-220-891-22 


17 


1513 


25.9 


2782 


15 


US-10-205-194-165 


18 


1191 


20.4 


1122 


9 


US-09-789-386-5 


19 


1191 


20.4 


1610 


9 


US-09-765-205-5 


20 


1191 


20.4 


2235 


14 


US-10-060-036-54 


21 


1183 


20.2 


2052 


18 


US-10-466-258-3 


22 


911.5 


15.6 


2100 


19 


US-10-479-081-217 


23 


908 


15.5 


799 


17 


US-10-660-946-2 


24 


908 


15.5 


1160 


14 


US-10-175-523-156 


25 


908 


15.5 


1785 


17 


US-10-439-388-62 


26 


899 


15.4 


994 


11 


US-09-978-360A-110 


27 


895 


15.3 


2610 


17 


US-10-641-643-382 


28 


888 


15.2 


868 


9 


US-09-789-386-3 


29 


868 


14.8 


1798 


18 


US-10-466-258-10 


30 


844 


14.4 


1514 


9 


US-09-823-245A-349 


31 


797.5 


13.6 


3202 


9 


US-09-954-456-210 


32 


797.5 


13.6 


3202 


17 


US-10-172-118-386 


33 


797.5 


13.6 


3202 


17 


US-10-342-887-386 


34 


797.5 


13.6 


3202 


18 


US-10-723-860-1480 


35 


797.5 


13.6 


3202 


19 


US-10-843-641A-3237 



Description 



Sequence 17, Appl 
Sequence 17, Appl 
Sequence 214, App 
Sequence 5, Appli 
Sequence 5, Appli 
Sequence 310, App 
Sequence 53, Appl 
Sequence 1, Appli 
Sequence 22, Appl 
Sequence 212, App 
Sequence 8, Appli 
Sequence 8, Appli 
Sequence 22, Appl 
Sequence 9, Appli 
Sequence 9, Appli 
Sequence 22, Appl 
Sequence 165, App 
Sequence 5, Appli 
Sequence 5, Appli 
■ Sequence 54 , Appl 
Sequence 3, Appli 
Sequence 217, App 
Sequence 2, Appli 
Sequence 156, App 
Sequence 62, Appl- 
Sequence 110, App 
Sequence 382, App 
Sequence 3, Appli 

Sequence 10 , Appl 
Sequence 349, App 
Sequence 210, App 
Sequence 38 6, App 
Sequence 38 6, App 
Sequence 14 80, Ap 
Sequence 3237, Ap 



36 


797.5 


13. 


6 


3305 


18 


US 


-10 


-723 


-860-5926 


Sequence 


5926, Ap 


37 


792 


13 


5 


2343 


17 


US 


-10 


-267 


-502-215 


Sequence 


Old 


App 


38 


789.5 


13 


5 


2331 


17 


US 


-10 


-267 


-502-213 


Sequence 




App 


39 


741.5 


12 


7 


3517 


17 


US 


-10 


-302 


-172-111 


Sequence 


111 
111, 


App 


40 


713 


12 


2 


3637 


17 


US 


-10 


i n o 
- lUo 






449, 


App 


41 


713 


12 


2 


3637 


17 


us 


-10 


-159 


-563-443 


Sequence 


443, 


App 


42 


686.5 


11 


7 


1502 


15 


us 


-10 


-205 


-219-94 


Sequence 


94, 


Appl 


43 


686 


11 


.7 


1520 


15 


us 


-10 


-084 


-817-333 


Sequence 


333, 


App 


44 


675 


11 


.5 


422 


9 


us- 


09- 


960- 


352-8477 


Sequence 


8477, 


Ap 


45 


671 


11 


.5 


1473 


15 


us 


-10 


-205 


-194-128 


Sequence 


128, 


App 



ALIGNMENTS 



RESULT 1 

US-09-893-348-17 

Sequence 17, Application US/09893348 
Patent No. US20020072493A1 
GENERAL INFORMATION: 
APPLICANT: EI SEN BACH- SCHWARTZ , Michal 
APPLICANT: COHEN, I run R. 
APPLICANT: BESERMAN, Pierre 
APPLICANT: MOSONEGO, Alon 
APPLICANT: MOALEM, Gila 

TITLE OF INVENTION: ACTIVATED T-CELLS, NERVOUS SYSTEM-SPECIFIC ANTIGENS AND 
THEIR USES 

FILE REFERENCE: EIS-SCHWARTZ=2A <D m J_ fj i 

CURRENT APPLICATION NUMBER: US/09/893,348 \^6n% cZfr*T 

CURRENT FILING DATE: 2001-06-28 J r . J # 

PRIOR APPLICATION NUMBER: US 09/314,161 ^ >^^C 

PRIOR FILING DATE: 1999-05-19 
PRIOR APPLICATION NUMBER: US 09/218,277 
PRIOR FILING DATE: 1998-12-22 
PRIOR APPLICATION NUMBER: PCT/US98/ 147 15 
PRIOR FILING DATE: 1998-07-21 
PRIOR APPLICATION NUMBER: IL 124500 
PRIOR FILING DATE: 1998-05-19 
NUMBER OF SEQ ID NOS : 29 
SOFTWARE: Patentln version 3.1 
SEQ ID NO 17 
LENGTH: 4 684 
TYPE: DNA 

ORGANISM: Rattus norvegicus 
FEATURE: 
NAME/ KEY: CDS 
LOCATION: (253) . ■ (3744) 
OTHER INFORMATION: 
US-09-893-348-17 



Alignment Scores: 

Pred. No. : .0 

Score: 5848.00 

Percent Similarity: 100.00% 
Best Local Similarity: 100.00% 

Query Match: 100.00% 

DB: 9 



Length: 4684 

Matches: 1163 

Conservative: 0 

Mismatches : 0 

Indels: 0 

Gaps : 0 



GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - nucleic search, using f rame_plus_p2n model 



Run on: 



April 16, 2005, 11:34:27 



Search time 10281.6 Seconds 

(without alignments) 

4305.640 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 



US-09-830-972-2 
5848 

1 MEDIDQSSLVSSSTDSPPRP 



VKDAMAKIQAKI PGLKRKAD 1163 



Scoring table: 



BLOSUM62 



Xgapop 10.0 , Xgapext 

Ygapop 10.0 , Ygapext 

Fgapop 6.0 , Fgapext 

Delop 6.0 , Delext 



0.5 
0.5 
7.0 
7.0 



Searched: 



34239544 seqs, 19032134700 residues 



Total number of hits satisfying chosen parameters: 



68479088 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Command line parameters : 
-MODEL=frame+_p2n. model -DEV=xlh 

Q=/cgn2_l/USPTO_spool_h/US09830972/runat_14042005_094 843_18360/app_query. fasta_l 
.3683 

-DB=EST -QFMT=fastap -SUFFIX=rst -MINMATCH=0 . 1 -LOOPCL=0 -LOOPEXT=0 
-UNITS=bits -START=1 -END=-1 -MATRIX=blosum62 -TRANS=human4 0 . cdi -LIST=45 
-DOCALIGN=200 -THR_SCORE=pct -THR_MAX=100 -THR_MIN=0 -ALIGN=15 - MO DE= LOCAL 
-OUTFMT=pto -NORM=ext -HEAPSIZE=500 -MINLEN=0 -MAXLEN=2 000 000000 
-USER=US09830972_@CGN_l_l_13747_@runat_14042005_094 843_18360 -NCPU=6 -ICPU=3 
-NO_MMAP -LARGEQUERY -NEG_SCORES=0 -WAIT -DSPBLOCK=100 -LONGLOG 

-DEV_TIMEOUT=120 -WARN_TIMEOUT=30 -THREADS=1 -XGAPOP=10 -XGAPEXT=0.5 -FGAPOP=6 
-FGAPEXT =7 -YGAPOP=10 -YGAPEXT=0 . 5 -DEL0P=6 -DELEXT=7 

Database : EST:* 



1 
2 
3 
4 
5 
6 
7 
8 
9 



gb 
gb 
gb 
gb 
gb 
gb 
gb 
gb 
gb 



estl:* 
est2:* 
htc:* 
est3 : * 
est4 : * 
est5 : * 
est6 : * 
gssl : * 
gss2 : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 



WO . 
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Match 


Length 
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Description 


1 


1290 


22. 


1 


r\ s~ r\ 

969 


5 


OTT O O A A A /I 

BU 8 39934 


BU839934 


AG EN COURT 


2 


i o o a c 


22. 


0 


Q CO O 

3533 


o 


AKUo4y U^: 


AK034902 


Mus muscu 


o 


1Z / 1 


21. 


7 


/ O 0 


D 


LAj 1 1 o / U 


CA511870 


UI-R-FJO- 


A 

4 


1 O RQ C 


21. 


5 


by d 


b 


LdZ U 4 4 1 o 


CB204418 


AGENCOURT 


5 


1 o c c 

1255 


21. 


5 


O A O 

842 


5 


nitn A A 1 /(A 

BU / 0 9 1 4 9 


BU709149 


UI-M-EWO- 


6 


1 A A A C 

1220 . 5 


20. 


9 


n O /C 

986 


5 


OTTO ^ T AAA 

BU 8 410 0 9 


BU841009 


AGENCOURT 


1 


1218 . 5 


20. 


8 


805 


4 


oTTom ao 
BI /30192 


BI730192 


603349739 


8 


n A i c 

1215 


20. 


8 


796 


6 


/"•7VCA/IOOA 

CAb 04/29 


CA504729 


UI-R-FJ0- 


9 


1198 


20. 


5 


o o a 

822 


6 


/-inCOl OOO 

CB521332 


CB521332 


UI-M-GH0- 


10 


1180 . 5 


20. 


2 


803 


7 


CN539361 


CN539361 


UI-M-HU0- 


11 


1163 


19. 


9 


*7 O O 

111 


6 


CA320618 


CA320618 


UI-M-FW0- 


12 


1131 


19. 


3 


778 


5 


BU709106 


BU709106 


UI-M-EW0- 


13 


1124 . 5 


19. 


2 


O O A 

772 


7 


CF94 85o 8 


CF948588 


UI-M-HJ0- 


14 


1122 . 5 


19. 


2 


AAA 

802 


6 


y*7\OAA^OC 

CA3 2 0635 


CA320635 


UI-M-FW0- 


15 


1121 


19. 


2 


756 


7 


CK635939 


CK635939 


UI-M-HN0- 


16 


1114 . 5 


19. 


1 


951 


5 


BQ892001 


BQ892001 


AGENCOURT 


17 


1110 . 5 


19. 


0 


1024 


7 


/~*XJ f~ A A O A O 

CN64384 / 


CN643847 


ILLUMIGEN 


18 


1107 


18. 


9 


1013 


7 


CN 64 6105 


CN646105 


ILLUMIGEN 


19 


1097 


18. 


8 


638 


6 


CB576696 


CB576696 


AMGNNUC : C 


20 


1094 


18. 


7 


862 


1 


AU079375 


AU079375 


AU079375 


21 


1084 . 5 


18. 


5 


739 


5 


nTT/*1 ri a r 1 

BU612951 


BU612951 


UI-M-FR0- 


22 


1080 . 5 


18. 


5 


767 


7 


CF /2 683b 


CF726835 


UI-M-HB0- 


23 


1080 


18. 


5 


714 


/ 


/-i\T C O A O O A 

CN532333 


CN532333 


UI-M-HQ0- 


24 


1077 . 5 


18. 


4 


O >! O 
/4 / 


/ 


r+r\ A A /I 1 O O 

C042413 / 


C0424137 


UI-M-HU0- 


25 


1074 


18. 


4 


726 


7 


/""T^TATOAA 

CF /3 7320 


CF737320 


UI-M-HD0- 


26 


1068 . 5 


18. 


3 


935 


5 


r>/"\ A /"" O A C O 

BQ9630b / 


BQ963057 


AGENCOURT 


27 


1063 


18. 


2 


O A /I 

/24 


/ 


/"» TT' r 7 A A C A O 

CF /29bU3 


CF729503 


UI-M-HD0- 


28 


1062 


18. 


2 


OTA 

/19 


/ 


/-*»t c A O A C A 


CN528062 


UI-M-HQ0- 


29 


1051 . 5 


18. 


0 


O >1 A 

742 


6 


/-*7\OAAOOO 

CA3 2 08 33 


CA320833 


UI-M-FW0- 


A A 

30 


1 ATI C 

1051 . 5 


18. 


0 


O A A 

844 


6 


rrjo /I C1 AC 

CB84 blOb 


CB845105 


M2PN-067 5 


O 1 

31 


1 A A O C 

1043 . 5 


17. 


8 


/ 14 


/ 


CN4 b61 / o 


CN456178 


UI-M-HN0- 


o o ■ 
32 


1U43 


17. 


8 


o n n 
/ UU 


A 

4 


DTCCd 1 OA 

bl bb4 1 / y 


BI664179 


603289106 


O "3 

33 


1 A A ~\ 

1041 


17. 


8 


O O O 
/ 13 


b 


pnA/| /lOAO 

CBZ44 /UZ 


CB244702 


UI-M-FY0- 


34 


1039 


17. 


8 


688 


7 


CF540092 


CF540092 


UI-M-EX0- 


35 


1023 


17. 


5 


673 


6 


CD349457 


CD349457 


UI-M-FY0- 


36 


1021.5 


17. 


5 


710 


7 


CN458728 


CN458728 


UI-M-HB0- 


37 


1017 


17. 


4 


871 


5 


BQ719894 


BQ719894 


AGENCOURT 


38 


1012 


17. 


3 


779 


7 


CO401465 


CO401465 


AGENCOURT 


39 


1004 


17. 


2 


751 


6 


CA315995 


CA315995 


UI-M-FW0- 


40 


1001.5 


17. 


1 


691 


5 


BU707644 


BU707644 


UI-M-FR0- 


41 


1000.5 


17. 


1 


782 


4 


BI739239 


BI739239 


603359521 


42 


997 


17. 


0 


609 


6 


CB580803 


CB580803 


AMGNNUC :N 


43 


995 


17. 


0 


651 


7 


CK619609 


CK619609 


mk26bl2.y 


44 


983 


16. 


8 


730 


4 


BI601346 


BI601346 


603245090 


45 


977.5 


16. 


7 


683 


7 


CN526355 


CN526355 


UI-M-HN0- 



ALIGNMENTS 



GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - nucleic search, using f rame_plus_p2n model 



Run on: 



April 14, 2005 f 17:00:28 



Search time 5390.01 Seconds 

(without alignments) 

3236.333 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 



SEQ2 
1823 

1 MEDI'DQSSLVSSSTDSPPRP 



VKDAMAKIQAKI PGLKRKAD 360 



Scoring table: 



BLOSUM62 



Xgapop 10.0 , Xgapext 

Ygapop 10.0 , Ygapext 

Fgapop 6.0 , Fgapext 

Delop 6.0 , Delext 



0.5 
0.5 
7.0 
7.0 



Searched: 



4708233 seqs, 24227607955 residues 



Total number of hits satisfying chosen parameters: 



9416466 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Command line parameters: 
-MODEL=frame+_p2n. model ~DEV=xlh 
-Q=/cgn2_l/USPTO_spool/NICHOLS-09-830-672- 
FUSION/runat_14042005_104546_4009/app_query . f asta_l . 1038 

-DB=GenEmbl -QFMT=fastap -SUFFIX=rge -MINMATCH=0 . 1 -LOOPCL=0 -LOOPEXT=0 
-UNITS=bits -START-1 -END=-1 -MATRIX=blosum62 -TRANS=human4 0 . cdi -LIST=45 
-DOCALIGN=200 -THR_SCORE=pct -THR_MAX=100 -THR_MIN=0 -ALIGN=15 -MODE=LOCAL 
-OUTFMT=pto -NORM=ext -HEAPSIZE=500 -MINLEN=0 -MAXLEN=2000000000 
-USER=NICHOLS-09-830-672-FUSION_@CGN_l_l_6287_@runat_14042005_104546_4009 
~NCPU=6 -ICPU=3 -NO_MMAP -LARGEQUERY -NEG_SCORES=0 -WAIT -DSPBLOCK=100 
-LONGLOG -DEV_TIMEOUT=120 - WARN_T IMEOUT= 3 0 -THREADS=1 -XGAPOP=10 -XGAPEXT=0 . 5 
-FGAPOP=6 -FGAPEXT=7 -YGAPOP=10 -YGAPEXT=0 . 5 -DELOP=6 -DELEXT=7 

Database : GenEmbl : * 



1 
2 
3 
4 
5 
6 
7 
8 
9 



gb_ba : * 
gb_htg: * 
gb_in: * 
gb_om: * 
gb_ov: * 
gb_pat : * 
gb_ph : * 
gb_pl : * 
gb_pr : * 
gb_ro : * 



10: 
11: 



gb_sts : * 



12: gb_sy:* 
1 3 : gb_un : * 
1 4 : gb_vi : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 
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ALIGNMENTS 

RESULT 1 
AY164740 

LOCUS AY164740 2156 bp mRNA linear ROD 30-JUN-2003 

DEFINITION Rattus norvegicus RTN4-B1 (Rtn4) mRNA, complete cds; alternatively 

spliced. 
ACCESSION AY164740 

VERSION AY164740.1 GI:32331280 

KEYWORDS 

SOURCE Rattus norvegicus (Norway rat) 

ORGANISM Rattus norvegicus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; 
Rattus . 

REFERENCE 1 (bases 1 to 2156) 

AUTHORS Oertle,T., Klinger,M., Stuermer, C. A. and Schwab, M.E. 

TITLE A reticular rhapsody: phylogenic evolution and nomenclature of the 

RTN/Nogo gene family 
JOURNAL FASEB J. 17 (10), 1238-1247 (2003) 
PUBMED 128 32288 
REFERENCE 2 (bases 1 to 2156) 

AUTHORS Oertle,T. and Schwab, M.E. 
TITLE Direct Submission 

JOURNAL Submitted ( 16-OCT-2002 ) Brain Research Institute and Department 

Biology, University of Zurich and ETH Zurich, Winterthurerstr . 190, 
Zurich 8057, Switzerland 
FEATURES Location/Qualifiers 
source 1. .2156 

/organism="Rattus norvegicus" 
/mol_type= M mRNA" 
/db_xref="taxon: 10116" 
« gene 1. .2156 

/gene="Rtn4" 
5 ! UTR 1. .188 

/gene="Rtn4" 
CDS 189. .1271 

/gene="Rtn4" 

/note="alternatively spliced" 
/ codon_start=l 
/product="RTN4-Bl" 
/protein_id="AAP47315 . 1" 
/db_xref="GI : 32331281" 

/ translation="MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDE 
EEDDEDLEELEVLERKPAAGLSAAAVPPAAAAPLLDFSSDSVPPAPRGPLPAAPPAAP 
ERQPSWERS PAAPAPSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPST 
PAAPKRRGSGSVWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSV 
TISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRR 
LFLVDDLVDSLKFAVI^WVFTYVGALraGLTLLILALISLFSIPVIYERHQVQIDHYL 
GLANKSVKDAMAKIQAKI PGLKRKAD" 
3'UTR 1272. .2156 

/gene="Rtn4" 

ORIGIN 



GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - nucleic search, using f rame_plus_p2n model 

Run on: April 14, 2005, 15:51:07 ; Search time 646.602 Seconds 

(without alignments) 
3295.856 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 



SEQ2 
1823 

1 MEDIDQSSLVSSSTDSPPRP. 



. VKDAMAKIQAKIPGLKRKAD 360 



Scoring table: 



BLOSUM62 
Xgapop 10 
Ygapop 10 
Fgapop 6 
Delop 6 



Xgapext 
Ygapext 
Fgapext 
Delext 



0.5 



Searched: 



4390206 seqs, 2959870667 residues 



Total number of hits satisfying chosen parameters: 8780412 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Command line parameters : 
-MODEL=frame+_p2n.modei -DEV=xlh 
-Q=/ cgn 2_l/USPTO_spool/NICHOLS-09-830-672- 
FUSION/runat_14042005_104545_3999/app_query. fasta_l. 1038 
-DB=N_Geneseq_16Dec04 -QFMT=fastap -SUFFIX=rng -MINMATCH=0 . 1 -LOOPCL=0 
-LOOPEXT=0 -UNITS=bits -START=1 -END=-1 -MATRIX=blosum62 -TRANS=human4 0 . cdi 
-LIST=45 -DOCALIGN=200 -THR_SCORE=pct -THR_MAX=100 -THR_MIN=0 -ALIGN=15 
-MODE=LOCAL -OUTFMT=pto -NORM=ext -HEAPSIZE=500 -MINLEN=0 -MAXLEN=2000000000 
-USER=NICHOLS-09-830-672-FUSION_@CGN_l_l_796_@runat_14042005_104545_3999 
-NCPU=6 -ICPU=3 -NO_MMAP -LARGEQUERY -NEG_SCORES=0 -WAIT -DSPBLOCK=100 
-LONGLOG -DEV_TIMEOUT=120 -WARN_TIMEOUT=30 -THREAD 3=1 -XGAPOP=10 -XGAPEXT=0 . 5 
-FGAPOP=6 -FGAPEXT =7 -YGAPOP=10 - YGAPEXT=0 . 5 -DELOP=6 -DELEXT=7 



Database : N_Geneseq__16Dec04 : * 

1: geneseqnl980s : * 

2: geneseqnl990s : * 

3: geneseqn2000s : * 

4: geneseqn2001as : * 

5: geneseqn2001bs : * 

6: geneseqn2002as : * 

7: geneseqn2002bs : * 

8: geneseqn2003as : * 

9 : geneseqn2003bs : * 

10: geneseqn2003cs : * 

11 : geneseqn2003ds : * 



12 : geneseqn2004as : * 
13: geneseqn2004bs : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 



NO . 
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Length 
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GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - nucleic search, using f rame_plus_p2n model 



Run on: 



April 14, 2005, 20:35:54 



Search time 434.896 Seconds 

(without alignments) 

1354.484 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 



SEQ2 
1823 

1 MEDIDQSSLVSSSTDSPPRP 



VKDAMAKIQAKIPGLKRKAD 360 



Scoring table: 



BLOSUM62 



Xgapop 10.0 , 
Ygapop 10.0 , 
Fgapop 6.0 , 
Delop 6.0 , 



Xgapext 
Ygapext 
Fgapext 
Delext 



0.5 
0.5 
7.0 
7.0 



Searched: 



1202784 seqs, 818138359 residues 



Total number of hits satisfying chosen parameters: 



2405568 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Command line parameters : 
-MODEL=frame+_p2n. model -DEV=xlh 
-Q=/cgn2_l/USPTO_spool/NICHOLS-09-830-672- 
FUSION/runat__14042005_104547_4031/app_query.fasta_l. 1038 
-DB=Issued_Patents_NA -QFMT=fastap -SUFFIX=rni -MINMATCH=0 . 1 -LOOPCL-0 
-LOOPEXT=0 -UNITS=bits -START=1 -END=-1 -MATRIX=blosum62 -TRANS=human4 0 . cdi 
-LIST=45 -DOCALIGN=200 -THR_SCORE=pct ~THR_MAX=100 -THR__MIN=0 -ALIGN=15 
-MODE=LOCAL -OUTFMT=pto -NORM=ext -HEAPSIZE=500 -MINLEN=0 -MAXLEN=2 0 0 0000 0 0 0 
-USER=NICHOLS-09-830-672-ETJSION - @CGN_l_l_2 19_@runat_14 042005_1 0454 7_4 031 
-NCPU=6 -ICPU=3 -NO_MMAP -LARGEQUERY -NEG_SCORES=0 -WAIT -DSPBLOCK=100 
-LONGLOG -DEV_TIMEOUT=120 -WARN_TIMEOUT=30 -THREADS=1 -XGAPOP=10 -XGAPEXT=0 . 5 
-FGAPOP=6 -FGAPEXT =7 -YGAPOP=10 -YGAPEXT=0 . 5 -DELOP=6 -DELEXT=7 

Database : Issued Patents NA:* 



1 : /cgn2_6/ptodata/l/ina/5A_COMB. seq: * 

2 : /cgn2_6/ptodata/l/ina/5B_COMB. seq: * 

3 : /cgn2_6/ptodata/l/ina/6A_COMB . seq : * 

4 : /cgn2_6/ptodata/l/ina/6B_COMB. seq: * 

5: /cgn2_6/ptodata/l/ina/PCTUS_COMB. seq:* 

6 : /cgn2_6/ptodata/l/ina/backf ilesl . seq: * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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ALIGNMENTS 

RESULT 1 

US-09-484-970B-106 

; Sequence 106, Application US/09484970B 



GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - nucleic search, using f rame_plus_p2n model 

Run on: April 15, 2005, 00:27:04 ; Search time 1596.28 Seconds 

(without alignments) 
1368.189 Million cell updates/sec 



Title: SEQ2 
Perfect score: 1823 

Sequence: 1 MEDIDQSSLVSSSTDSPPRP VKDAMAKIQAKI PGLKRKAD 360 



Scoring table: BLOSUM62 

Xgapop 10.0 , Xgapext 0.5 

Ygapop 10.0 , Ygapext 0.5 

Fgapop 6.0 , Fgapext 7.0 

Delop 6.0 , Delext 7.0 

Searched: 5622541 seqs, 3033355566 residues 

Total number of hits satisfying chosen parameters: 11245082 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



Post- processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Command line parameters : 
-MODEL=f rame+_p2n. model -DEV=xlh 
-Q=/cgn2_l/USPTO_spool/NICHOLS-09-830-672- 
FUSION/runat_14042005_104549_4109/app_query . fasta_l . 1038 
-DB=Published_Applications_NA -QFMT=fastap -SUFFIX-rnpb -MINMATCH=0 . 1 
-LOOPCL=0 -LOOPEXT=0 -UNITS=bits - START =1 -END=-1 -MATRIX=blosum62 
-TRANS=human4 0.cdi -LIST=45 -DOCALIGN=200 -THR_SCORE=pct -THR_MAX=100 
-THR_MIN=0 -ALIGN=15 -MODE=LOCAL -OUTFMT=pto -NORM=ext -HEAPSIZE=500 -MINLEN=0 
-MAXLEN=2000000000 

-USER=NICHOLS-09-830-672-FUSION_@CGN_l_l_893_@runat_14042005_10454 9_4109 
-NCPU=6 -ICPU=3 -NO_MMAP -LARGEQUERY -NEG_SCORES=0 -WAIT -DSPBLOCK=100 
-LONGLOG -DEV_TIMEOUT=120 -WARN_TIMEOUT=30 -THREADS=1 -XGAPOP=10 -XGAPEXT=0 . 5 
-FGAPOP=6 -FGAPEXT =7 -YGAPOP=10 -YGAPEXT=0 . 5 -DELOP=6 -DELEXT=7 



Database : Published_Applications_NA: * 

1: /cgn2_6/ptodata/2/pubpna/US07_PUBCOMB. seq:* 

2: /cgn2_6/ptodata/2/pubpna/PCT_NEW_PUB.seq:* 

3: /cgn2_6/ptodata/2/pubpna/US06_NEW_PUB. seq: * 

4: /cgn2_6/ptodata/2/pubpna/US06_PUBCOMB.seq: * 

5: /cgn2_6/ptodata/2/pubpna/US07_NEW_PUB.seq: * 

6: /cgn2_6/ptodata/2/pubpna/PCTUS_PUBCOMB.seq: * 

7 : /cgn2_6/ptodata/2/pubpna/US08_NEW_PUB. seq: * 

8 : /cgn2_6/ptodata/2/pubpna/US08_PUBCOMB. seq: * 

9: /cgn2_6/ptodata/2/pubpna/US09A_PUBCOMB.seq: * 
10: /cgn2_6/ptodata/2/pubpna/US09B_PUBCOMB.seq:* 
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6/ptodata/2/pubpna/US09C_PUBCOMB 
_ 6/ptodata/2/pubpna/US09_NEW_PUB.s 
"6/ptodata/2/pubpna/US10A_PUBCOMB. 
~6/ptodata/2/pubpna/US10B_PUBCOMB. 
~6/ptodata/2/pubpna/US10C_PUBCOMB. 
"6/ptodata/2/pubpna/US10D_PUBCOMB. 
"6/ptodata/2/pubpna/US10E_PUBCOMB. 
~6/ptodata/2/pubpna/US10F_PUBCOMB. 
~6/ptodata/2/pubpna/US10_NEW_PUB.s 
"6/ptodata/2/pubpna/USll_NEW_PUB.s 
~6/ptodata/2/pubpna/US60_NEW_PUB.s 
~6/ptodata/2/pubpna/US60_PUBCOMB.s 



seq: 
eq: * 
seq: 
seq: 
seq: 
seq: 
seq: 
seq: 
eq: * 
eq: * 
eq: * 
eq: * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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ALIGNMENTS 

RESULT 1 

US-10-205-194-165 

; Sequence 165, Application US/10205194 

; Publication No. US20030134301A1 

; GENERAL INFORMATION: 

; APPLICANT: Warner-Lambert Company 

; APPLICANT: Lee, Kevin 

; APPLICANT: Dixon, Alistair 

; APPLICANT: Brooksbank, Robert 

APPLICANT: Pinnock, Robert 
; TITLE OF INVENTION: Identification and Use of Molecules Implicated in Pain 
; FILE REFERENCE: WL-A-018201 
; CURRENT APPLICATION NUMBER: US/ 10/205 , 194 
; CURRENT FILING DATE: 5200-07-24 
; PRIOR APPLICATION NUMBER: GB 0118354.0 
; PRIOR FILING DATE: 2001-07-27 
; NUMBER OF SEQ ID NOS : 177 
; SOFTWARE: Patentln Ver. 2.1 
; SEQ ID NO 165 

LENGTH: 2782 

TYPE: DNA 
; ORGANISM: Rattus norvegicus 

FEATURE: 

; OTHER INFORMATION: Foocen-m2 reticulon 
US-10-205-194-165 



Alignment Scores: 



Pred. No. : 


3.27e-150 


Length : 


2782 


Score : 


1800.50 


Matches : 


359 


Percent Similarity: 


94.99% 


Conservative : 


1 


Best Local Similarity: 


94.72% 


Mismatches : 


0 


Query Match: 


98.77% 


Indels : 


19 


DB: 


15 


Gaps : 


1 



SEQ 2 (1-360) x US-10-205-194-165* (1-2782) 

Qy 1 MetGluAspIleAspGlnSerSerLeuValSerSerSerThrAspSerProProArgPro 20 

I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I M 
Db 701 ATGGAAGAC AT AGAC C AGT C GT C GCT GGT CTCCTCGTC C AC G GACAGC C CGCCCCGGCCT 760 



Qy 

Db 



21 ProProAlaPheLysTyrGlnPheValThrGluProGluAspGluGluAspGluGluGlu 40 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
761 C C GC CCGCCTT CAAGT AC C AGTT C GT GAC GGAGC CC GAG GAC GAGGAGGAC GAGGAGGAG 820 



GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - nucleic search, using f rame_plus_p2n model 



Run on: 



April 14, 2005, 17:27:17 



Search time 3995.95 Seconds 

(without alignments) 

3429.256 Million cell updates/se 



Title: 

Perfect score: 
Sequence : 



SEQ2 
1823 

1 MEDIDQSSLVSSSTDSPPRP 



VKDAMAKIQAKI PGLKRKAD 360 



Scoring table : 



BLOSUM62 



Xgapop 10.0 
Ygapop 10.0 
Fgapop 6 . 0 
Delop 6.0 



Xgapext 
Ygapext 
Fgapext 
Delext 



0.5 
0.5 
7.0 
7.0 



Searched: 



34239544 seqs, 19032134700 residues 



Total number of hits satisfying chosen parameters: 



68479088 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Command line parameters : 
-M0DEL=f rame+_p2n. model -DEV=xlh 
-Q=/cgn2_l/USPTO_spool/NICHOLS-09-830-672- 
FUSION/runat_14042005_104546_4021/app_query . fasta_l . 1038 
-DB=EST -QFMT=fastap -SUFFIX=rst -MINMATCH=0 . 1 -LOOPCL=0 -LOOPEXT=0 
-UNITS=bits -START=1 -END=-1 -MATRIX=blosum62 -TRANS=human4 0 . cdi -LIST=45 
-DOCALIGN=200 -THR_SCORE=pct -THR_MAX=100 -THR_MIN=0 -ALIGN=15 -MODE=LOCAL 
-OUTFMT-pto -NORM=ext -HEAPSIZE=500 -MINLEN-0 -MAXLEN=2 000000000 
-USER=NICHOLS-09-830-672-FUSION_@CGN_l_l_5533__@runat_14042005_10454 6_4 021 
-NCPU=6 -ICPU=3 -MO__MMAP -LARGEQUERY -NEG_SCORES=0 -WAIT -DSPBLOCK=100 
-LONGLOG -DEV_TIMEOUT=120 -WARN_TIMEOUT=30 -THREADS- 1 -XGAPOP=10 -XGAPEXT=0 
-FGAPOP=6 -FGAPEXT =7 -YGAPOP=10 -YGAPEXT=0 . 5 -DELOP=6 -DELEXT=7 

Database : EST:* 



1 
2 
3 
4 
5 
6 
7 
8 
9 



gb_estl : * 
gb_est2 : * 
gb_htc : * 
gb_est3:* 
gb_est4 : * 
gb_est5 : * 
gb_est6 : * 
gb_gssl : * 
gb_gss2 : * 



Pred. No. is the number of results predicted by chance to have a 



score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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ALIGNMENTS 



GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - nucleic search, using f rame_plus_p2n model 



Run on: April 14, 2005, 17:00:28 ; Search time 5404.99 Seconds 

(without alignments) 
3236.333 Million cell updates/sec 

Title: SEQ29 
Perfect score: 1850 

Sequence: 1 MEDLDQSPLVSSSDSPPRPQ VKDAMAKIQAKIPGLKRKAE 361 



Scoring table: BLOSUM62 

Xgapop 10.0 , Xgapext 0.5 

Ygapop 10.0 , Ygapext 0.5 

Fgapop 6.0 , Fgapext 7.0 

Delop 6.0 , Delext 7.0 

Searched: 4708233 seqs, 24227607955 residues 

Total number of hits satisfying chosen parameters: 9416466 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Command line parameters : 
-MODEL=frame+_p2n. model -DEV=xlh 
-Q=/cgn2_l/USPTO_spool/NICHOLS-09-830-672- 
FUSION/runat_14042005_104546_4009/app_query. fasta_l. 1038 

-DB=GenEmbl ~QFMT=fastap -SUFFIX=rge -MINMATCH=0 . 1 -LOOPCL=0 -LOOPEXT=0 
-UNITS=bits -START=1 -END=-1 -MATRIX=blosum62 -TRANS=human4 0 . cdi -LIST=45 
-DOCALIGN=200 -THR_SCORE=pct -THR_MAX=10.0 -THR_MIN=0 -ALIGN=15 - MO DE= LOCAL 
-OUTFMT=pto -NORM=ext -HEAPSIZE=500 -MINLEN=0 -MAXLEN=2 000000000 
-USER=NICHOLS-09-830-672-FUSION_@CGN_l_l_6287_@runat_14 042005_104 546__4009 
-NCPU=6 -ICPU=3 -N0_MMAP -LARGEQUERY -NEG_SCORES=0 -WAIT -DSPBLOCK=100 
-LONGLOG -DEV_TIMEOUT=120 -WARN_TIMEOUT=30 -THREADS=1 -XGAPOP=10 -XGAPEXT=0.5 
-FGAPOP=6 -FGAPEXT=7 -YGAPOP=10 -YGAPEXT=0 . 5 -DELOP=6 -DELEXT=7 



GenEmbl:* 


1 


gb ba : * 


2 


gb htg : * 


3 


gb in:* 


4 


gb om : * 


5 


gb o v : * 


6 


gb pat:* 


7 


: gb ph:* 


8 


gb pi : * 


9 


: gb pr: + 


10: gb_ro:* 


11: gb_sts:* 



12: gb_sy:* 
1 3 : gb_un : * 
14: gb_vi:* 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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2256 


10 


AF132046 


AF132046 


Rattus no 


29 


1521 


82. 


2 


2266 


10 


AY102282 


AY102282 


Mus muscu 


30 


1513.5 


81. 


8 


2209 


10 


AY102281 


AY102281 


Mus muscu 


31 


1503 


81. 


2 


1276 


10 


AY114153 


AY114153 


Mus muscu 


32 


1424.5 


77. 


0 


3576 


6 


AX766050 


AX766050 


Sequence 


33 


1424.5 


77. 


0 


3579 


6 


BD249446 


BD249446 


Protein s 


34 


1424.5 


77. 


0 


3579 


9 


HSA251383 


AJ251383 


Homo sapi 


35 


1424.5 


77. 


0 


3919 


6 


CQ829486 


CQ829486 


Sequence 


36 


1424.5 


77. 


0 


4053 


6 


AX195249 


AX195249 


Sequence 


37 


1424.5 


77. 


0 


4053 


9 


AB020693 


AB020693 


Homo sapi 


38 


1424.5 


77. 


0 


4166 


9 


AB040462 


AB040462 


Homo sapi 


39 


1424.5 


77. 


0 


4632 


9 


AF148537 


AF148537 


Homo sapi 


40 


1424.5 


77. 


0 


4789 


6 


CQ874017 


CQ874017 


Sequence 


41 


1424.5 


77. 


0 


4789 


9 


AY102279 


AY102279 


Homo sapi 


42 


1401 


75. 


7 


4093 


6 


BD270070 


BD270070 


Secreted 


43 


1396 


75. 


5 


4822 


6 


AR220865 


AR220865 


Sequence 


44 


1220.5 


66. 


0 


2934 


6 


CQ828004 


CQ828004 


Sequence 



45 1154 62.4 3489 6 AX766046 AX766046 Sequence 



RESULT 1 
BD249448 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



ORIGIN 



ALIGNMENTS 



BD249448 1122 bp DNA linear PAT 17-JUL-2003 

Protein similar to neuroendrocrine-specif ic protein, and encoding 
cDNA. 
BD249448 

BD249448.1 GI: 33059218 
JP 2002522016-A/3. 
Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 1122) 
Michalovich, D. and Prinjha,R.K. 

Protein similar to neuroendrocrine-specif ic protein, and encoding 

Patent: JP 2002522016-A 3 23-JUL-2002; 

SMITHKLINE BEECHAM PLC 

OS Homo s apiens ( human ) 

PN JP 2002522016-A/3 

PD 23-JUL-2002 

PF 21-JUL-1999 JP 2000561310 

PR 22-JUL-1998 GB 98 16024 . 5, 19- JUL-1999 GB 9916898.1 PI 

DAVID MICHALOVICH, RABINDER KUMAR PRINJHA 

PC 

C12N15/09,C07K14/47,C07K16/18, C12N1/15,C12N1/19,C12N1/21,C12N5/ PC 
10, 

PC C12P21/02,C12Ql/02,C12Ql/68,G01N33/15,G01N33/50,G01N33/53, PC 

G01N33/566// 
PC . C12P21/08,C12N15/00,C12N5/00 

CC Protein similar to neuroendrocrine-specif ic protein, and CC 

encoding cDNA 
FH Key Location/Qualif iers 

FT source 1. .1122 

FT /organism= 1 Homo sapiens (human)'. 

Location/Qualifiers 
1. .1122 

/organism="Homo sapiens" 
/mol_type=" genomic DNA" 
/db xref="taxon: 9606" 



Alignment Scores: 



Pred. No . : 


1. 03e-65 


Length : 


1122 


Score : 


1829.00 


Matches : 


360 


Percent Similarity: 


96.51% 


Conservative : 


0 


Best Local Similarity: 


96.51% 


Mismatches : 


1 


Query Match: 


98.86% 


Indels : 


12 


DB: 


6 


Gaps : 


1 



SEQ29 (1-361) x BD249448 (1-1122) 

Qy 1 MetGluAspLeuAspGlnSerProLeuValSerSerSerAspSerProProArgProGln 20 



GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - nucleic search, using f rame_plus_p2n model 



Run on: 



April 14, 2005 f 15:51:07 ; Search time 648.398 Seconds 

(without alignments) 
3295.856 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 



SEQ29 
1850 

1 MEDLDQSPLVSSSDSPPRPQ. 



. VKDAMAKIQAKI PGLKRKAE 361 



Scoring table: 



BLOSUM62 
Xgapop 10. 
Ygapop 10. 
Fgapop 6 . 
Delop 6. 



Xgapext 
Ygapext 
Fgapext 
Delext 



Searched: 



4390206 seqs, 2959870667 residues 



Total number of hits satisfying chosen parameters: 8780412 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Command line parameters : 
-MODEL=f rame+_p2n. model -DEV=xlh 
-Q=/cgn2_l/USPTO_spool/NICHOLS-09-830-672- 
FUSION/runat_14042005_104545_3999/app_query. fasta_l. 1038 
-DB=N_Geneseq_16Dec04 -QFMT=fastap -SUFFIX=rng -MINMATCH— 0 . 1 -LOOPCL=0 
-LOOPEXT=0 -UNITS=bits -START=1 -END=-1 -MATRIX=blosum62 -TRANS=human4 0 . cdi 
-LIST=45 -DOCALIGN=200 -THR_SCORE=pct -THR_MAX=100 -THR_MIN=0 -ALIGN=15 
-MO DE= LOCAL -OUTFMT=pto -NORM=ext -HEAPSIZE=500 -MINLEN=0 -MAXLEN=2 000000000 
-USER=NICHOLS-09-830-672-FUSION_@CGN_l_l_796_@runat_14 042005_104 545_3999 
-NCPU=6 -ICPU=3 -NO_MMAP -LARGEQUERY -NEG_SCORES=0 -WAIT -DSPBLOCK=100 
-LONGLOG -DEV_TIMEOUT=120 -WARN_TIMEOUT=3 0 -THREADS=1 -XGAPOP=10 -XGAPEXT=0 . 
-FGAPOP=6 -FGAPEXT =7 -YGAPOP=10 -YGAPEXT=0 . 5 -DELOP=6 -DELEXT=7 



Database : N_Geneseq_16Dec04 : * 

1: geneseqnl980s : * 

2: geneseqnl990s : * 

3: geneseqn2000s : * 

4: geneseqn2001as : * 

5: geneseqn2001bs : * 

6: geneseqn2002as : * 

7 : geneseqn2002bs : * 

8: geneseqn2003as : * 

9: geneseqn2003bs : * 

10: geneseqn2003cs : * 

11 : geneseqn2003ds : * 



12: geneseqn2004as : * 
13: geneseqn2004bs : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES ■ 

% 

Result Query 

No. Score Match Length DB ID Description 
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50. 
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Aai98079 Human neu 


37 
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50. 
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4 
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Aaf90323 Human NOG 


38 


927 


50. 


1 


770 


3 


AAA72983 


Aaa72983 Human NSP 


39 


927 


50. 


1 


799 


2 


AAV23695 


Aav23695 Human NSP 


40 


927 


50. 


1 


1213 


2 


AAX04379 


Aax04379 Human sec 


41 
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50. 
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12 


ADK14166 


Adkl4166 Human aut 
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12 
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AAV30920 


Aav30920 Human sec 



GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - nucleic search, using f rame_plus_p2n model 

Run on: April 14, 2005, 20:35:54 ; Search time 436.104 Seconds 

(without alignments) 
1354.484 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 



SEQ29 
1850 

1 MEDLDQSPLVSSSDSPPRPQ VKDAMAKIQAKIPGLKRKAE 361 



Scoring table: BLOSUM62 

Xgapop 10.0 , Xgapext 0.5 

Ygapop 10.0 , Ygapext 0.5 

Fgapop 6.0 , Fgapext 7.0 

Delop 6.0 , Delext 7.0 



Searched: 



1202784 seqs, 818138359 residues 



Total number of hits satisfying chosen parameters: 2405568 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Command line parameters : 
-MODEL=f rame+_p2n. model -DEV=xlh 
-Q=/cgn2_l/USPTO__spool/NICHOLS-09-830-672- 
FUSION/runat_14042005_104547_4031/app_query. fasta_l. 1038 
-DB=Issued_Patents_NA -QFMT-fastap -SUFFIX=rni -MINMATCH=0 . 1 -LOOPCL=0 
-LOOPEXT=0 -UNITS=bits -START=1 -END=-1 -MATRIX=blosum62 -TRANS=human40 . cdi 
-LIST=45 -DOCALIGN=200 -THR_SCORE=pct -THR_MAX=100 -THR_MIN=0 -ALIGN=15 
-MODE=LOCAL -OUTFMT=pto -NORM=ext -HEAPSIZE=500 -MINLEN=0 -MAXLEN=2000000000 
-USER^NICHOLS-09-830-672-FUSION_@CGN_l_l_219_@runat_14042005_104547_4 031 
-NCPU=6 -ICPU=3 -NO_MMAP -LARGEQUERY -NEG_SCORES=0 -WAIT -DSPBLOCK=100 
-LONGLOG -DEV_TIMEOUT=120 -WARN_TIMEOUT=30 -THREADS =1 -XGAPOP=10 -XGAPEXT=0 . 5 
-FGAPOP=6 -FGAPEXT =7 -YGAPOP=10 -YGAPEXT=0.5 -DELOP=6 -DELEXT=7 



Database 



Issued Patents NA: 
1 
2 
3 
4 
5 
6 



/cgn2_6/ptodata/l/ina/5A_COMB. seq: * 
/cgn2_6/ptodata/l/ina/5B_COMB. seq: * 
/cgn2_6/ptodata/l/ina/6A_COMB. seq: * 
/cgn2_6/ptodata/l/ina/6B_COMB. seq: * 
/cgn2_6/ptodata/l/ina/PCTUS_COMB. seq: 
/ cgn2_6/ptodata/l/ina/backf ilesl . seq: 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 





No. 


Score 


Match 


Length 


DB 


ID 








Description 




1 


1396 


75 


.5 


4822 


3 


us- 


09- 
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970B-106 


Sequence 


106, App 




2 
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2 
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08- 
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607-2 


Sequence 


2, Appli 
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016-3253 


Sequence 


3253, Ap 




4 
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09- 


023- 


655-382 


Sequence 


382, App 




5 


762 


41 


.2 


2069 


4 


us- 


09- 


949- 


016-3309 


Sequence 


3309, Ap 




6 


762 


41 


.2 


3202 


4 


us- 


09- 


949- 


016-1127 


Sequence 


1127, Ap 




7 


678 


36 


.6 


1766 


3 


us- 


09- 


149- 


476-254 


Sequence 


254, App 




8 


665 


35 


.9 


2664 


3 


us- 


09- 


149- 


476-255 


Sequence 


255, App 




9 


627 . 5 


33 


.9 


2262 


4 


us- 


09- 


949- 


016-2988 


Sequence 


2988, Ap 




10 


556 


30 


.1 


1095 


2 


us- 


08- 


700- 


607-4 


Sequence 


4, Appli 




11 


554 


29 


.9 


3517 


4 


us- 


09- 


799- 


451-111 


Sequence 


111, App 




12 


548 


29 


.6 


794 


3 


us- 


09- 


149- 


476-102 


Sequence 


102, App 




13 


529 


28 


.6 


2181 


4 


us- 


09- 


949- 


016-1419 


Sequence 


1419, Ap 




14 


518 


28 


.0 


2014 


4 


us- 


09- 


270- 


767-13561 


Sequence 


13561, A 




15 


347 


18 


.8 


441 


4 


us- 


09- 


513- 


999C-2227 


Sequence 


2227, Ap 




16 


344 . 5 


18 


.6 


454 


4 


us- 


09- 


621- 


976-7 40 


Sequence 


740, App 




17 


344 . 5 


18 


.6 


463 


4 


us- 


09- 


621- 


976-741 


Sequence 


741, App 




18 
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18 


.4 


601 


4 


us- 


09- 


949- 


016-117588 


Sequence 


117588, 




19 


341 


18 


.4 


601 


4 


us- 


09- 


949- 


016-117589 


Sequence 


117589, 




20 


341 


18 


.4 


42075 


4 


us- 


09- 


949- 


016-14995 


Sequence 


14995, A 




21 


299 


16 


.2 


443 


4 


us- 


09- 


513- 


999C-3784 


Sequence 


3784, Ap 




22 


284 . 5 


15 


.4 


135667 


4 


us- 


09- 


.949™ 


016-15051 


Sequence 


15051, A 




23 


284.5 


15 


.4 


152486 


4 


us- 


09- 


949- 


016-12869 


Sequence 


12869, A 




24 


279.5 


15 


.1 


601 


4 


us- 


09- 


949- 


016-40169 

\J J- \J ± \J A- \J ^ 


Sequence 


40169, A 


Q 


25 


279.5 


15 


.1 


601 


4 


us- 


09- 


949- 


016-119335 


Sequence 


119335, 




26 


276 


14 


. 9 


261 


2 


us- 


08- 


700- 


607-9 


Sequence 


9, Appli 




27 


271 


14 


.6 


13906 


4 


us- 


09- 


949- 


016-14730 


Sequence 


14730, A 




28 


256 


13 


.8 


200 


4 


us- 


09- 


513- 


999C-11198 


Sequence 


11198, A 




29 


252 . 5 


13 


.6 


15661 


4 


us- 


09- 


949- 


016-13161 


Sequence 


13161, A 




30 


226 


12 


.2 


2580 


3 


us- 


09- 


050- 


863-2 


Sequence 


2, Appli 


c 


31 


226 


12 


.2 


2580 


3 


us- 


09- 


359- 


081-2 


Sequence 


2, Appli 




32 


226 


12 


.2 


8705 


4 


us- 


09- 


647- 


344A-14 


Sequence 


14, Appl 




33 


226 


12 


.2 


9600 


3 


us- 


08- 


910- 


647-1 


Sequence 


1, Appli 


Q 


34 


226 


12 


.2 


9600 


3 


us- 


09- 


620- 


925-1 


Sequence 


1, Appli 


c 


35 


226 


12 


.2 


10596 


1 


us- 


07- 


884- 


811-15 


Sequence 


15, Appl 


c 


36 


226 


12 


.2 


10596 


1 


us- 


07- 


885- 


971-15 


Sequence 


15, Appl 


c 


37 


226 


12 


.2 


10596 


1 


us- 


•08- 


087- 


783A-15 


Sequence 


15, Appl 


c 


38 


226' 


12 


.2 


10596 


1 


us- 


08- 


194- 


088B-15 


Sequence 


15, Appl 


c 


39 


226 


12 


.2 


10596 


2 


us- 


08- 


■194- 


■087-15 


Sequence 


15, Appl 


c 


40 


226 


12 


.2 


10596 


5 


PCT 


'-US93-04648-15 


Sequence 


15, Appl 




41 


226 


12 


.2 


16080 


4 


us- 


■09- 


■724- 


566A-48 


Sequence 


48, Appl 




42 


226 


12 


.2 


16080 


4 


us- 


■09- 


•471- 


669A-48 


Sequence 


48, Appl 




43 


221 


11 


.9 


5452 


2 


us- 


■09- 


■130- 


■114-1 


Sequence 


1, Appli 


c 


44 


219 


11 


.8 


1926 


3 


us- 


■09- 


•249- 


585A-2 


Sequence 


2, Appli 


c 


45 


219 


11 


.8 


1926 


4 


us- 


■09- 


•410- 


399-3 


Sequence 


3, Appli 



ALIGNMENTS 

RESULT 1 

US-09-484-970B-106 

; Sequence 106, Application US/09484970B 



GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - nucleic search, using f rame_plus_p2n model 



Run on: April 15, 2005, 00:27:04 ; Search time 1600.72 Seconds 

(without alignments) 
1368.189 Million cell updates/sec 

Title: SEQ29 
Perfect score: 1850 

Sequence: 1 MEDLDQSPLVSSSDSPPRPQ VKDAMAKIQAKIPGLKRKAE 361 



Scoring table: BLOSUM62 

Xgapop 10.0 , Xgapext 0.5 

Ygapop 10.0 , Ygapext 0.5 

Fgapop 6.0 , Fgapext 7.0 

Delop 6.0 , Delext 7.0 

Searched: 5622541 seqs, 3033355566 residues 

Total number of hits satisfying chosen parameters: 11245082 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Command line parameters: 
-MODEL=frame+_p2n. model -DEV=xlh 
-Q=/cgn2_l/USPTO_spool/NICHOLS-09-830-672- 
FUSION/runat_14042005_104549_4109/app_query. fasta_l. 1038 
-DB=Published_Applications_NA -QFMT=fastap -SUFFIX=rnpb -MINMATCH=0 . 1 
-LOOPCL=0 -LOOPEXT=0 -UNITS=bits -START-1 -END=-1 -MATRIX=blosum62 
-TRANS=human4 0.cdi -LIST=45 -DOCALIGN=200 -THR_SCORE=pct -THR_MAX=100 
-THR_MIN=0 -ALIGN=15 -MODE=LOCAL -OUTFMT=pto -NORM~ext -HEAPSIZE=500 -MINLEN 
-MAXLEN=2000000000 

-USER=NICHOLS-09-830-672-FUSION_(§CGN_l_l_893_@runat__14 042005_104549_4109 
-NCPU=6 -ICPU=3 -NO_MMAP -LARGEQUERY -NEG_SCORES=0 -WAIT -DSPBLOCK=100 
-LONGLOG -DEV_TIMEOUT=120 ~WARN_TIMEOUT=30 -THREADS=1 -XGAPOP=10 -XGAPEXT=0. 
-FGAPOP=6 -FGAPEXT =7 -YGAPOP=10 -YGAPEXT=0 . 5 -DELOP=6 -DELEXT— 7 

Database : Published_Applications_NA: * 

1: /cgn2_6/ptodata/2/pubpna/US07_PUBCOMB. seq: * 

2 : /cgn2_6/ptodata/2/pubpna/PCT_NEW_PUB . seq : * 

3: /cgn2_6/ptodata/2/pubpna/US06_NEW_PUB. seq: * 

4 : /cgn2_6/ptodata/2/pubpna/US06_PUBCOMB. seq: * 

5: /cgn2_6/ptodata/2/pubpna/US07_NEW_PUB. seq: * 

6: /cgn2_6/ptodata/2/pubpna/PCTUS_PUBCOMB. seq:* 

7 : /cgn2_6/ptodata/2/pubpna/US08_NEW_PUB . seq: * 

8: /cgn2_6/ptodata/2/pubpna/US08_PUBCOMB. seq: * 

9: /cgn2_6/ptodata/2/pubpna/US09A_PUBCOMB. seq: * 
10: /cgn2_6/ptodata/2/pubpna/US09B_PUBCOMB. seq: * 



11 


/cgn2 


12 


/cgn2 


13 


/cgn2 


14 


/cgn2 


15 


/cgn2 


16 


/cgn2 


17 


/cgn2 


18 


/cgn2 


19 


/cgn2 


20 


/cgn2 


21 


/cgn2 


22 


/cgn2 



_6/ptodata/2/pubpna/US09C_PUBCOMB.seq: 
~6/ptodata/2/pubpna/US09_NEW_PUB. seq: * 
~6/ptodata/2/pubpna/US10A_PUBCOMB. seq: 
~6/ptodata/2/pubpna/US10B_PUBCOMB.seq: 
~6/ptodata/2/pubpna/US10C_PUBCOMB . seq : 
~6/ptodata/2/pubpna/US10D_PUBCOMB.seq: 
_6/ptodata/2/pubpna/US10E_PUBCOMB. seq: 
]6/ptodata/2/pubpna/US10F_PUBCOMB.seq: 
[6/ptodata/2/pubpna/US10_NEW_PUB. seq:* 
]6/ptodata/2/pubpna/USll_NEW_PUB. seq: * 
[6/ptodata/2/pubpna/US60_NEW_PUB. seq: * 
~6/ptodata/2/pubpna/US60_PUBCOMB. seq: * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



% 

Result Query 

No. Score Match Length DB 



ID 



Description 



1 


1829 


98 . 


9 


1122 


9 


US- 09-7 8 9-38 6-5 


Sequence 5, Appli 


2 


1829 


98 . 


9 


1610 


9 


US-09-765-205-5 


Sequence 5, Appli 


3 


1829 


98 . 


9 


2235 


14 


US-10-060-036-54 


Sequence 54 , Appl 


4 


1821 


98 . 


4 


2052 


18 


US-10-466-258-3 


Sequence 3, Appli 


5 


1545 


83. 


5 


2782 


15 


US- 10-2 05- 194 -165 


Sequence 165, App 


6 


1424.5 


77. 


0 


3579 


9 


US-09-789-386-1 


Sequence 1, Appli 


7 


1424 . 5 


77 . 


0 


3579 


9 


US-09-893-348-22 


Sequence 22, Appl 


8 


1424.5 


77. 


0 


3579 


17 


US-10-267-502-212 


Sequence 212, App 


9 


1424.5 


77. 


0 


3579 


18 


US-10-327-213-8 


Sequence 8, Appli 


10 


1424.5 


77. 


0 


3579 


18 


US-10-466-258-8 


Sequence 8, Appli 


11 


1424.5 


. 77. 


0 


3579 


18 


US-10-810-653-22 


Sequence 22, Appl 


12 


1424.5 


77. 


0 


4053 


9 


US-09-758-140-5 


Sequence 5, Appli 


13 


1424.5 


77. 


0 


4053 


9 


US-09-972-599A-5 


Sequence 5, Appli 


14 


1424.5 


77. 


0 


4053 


18 


US-10-717-597-310 


Sequence 310, App 


15 


1424.5 


77. 


0 


4632 


14 


US-10-060-036-53 


Sequence 53, Appl 


16 


1154 


62. 


4 


4684 


9 


US-09-893-348-17 


Sequence 17, Appl 


17 


1154 


62. 


4 


4684 


18 


US-10-810-653-17 


Sequence 17, Appl 


18 


1114 


60. 


2 


3492 


17 


US-10-267-502-214 


Sequence 214, App 


19 


942.5 


50. 


9 


1980 


17 


US-10-220-891-22 


Sequence 22, Appl 


20 


927 


50. 


1 


799 


17 


US-10-660-946-2 


Sequence 2, Appli 


21 


927 


50. 


1 


1160 


14 


US-10-175-523-156 


Sequence 156, App 


22 


927 


50. 


1 


1785 


17 


US-10-439-388-62 


Sequence 62, Appl 


23 


918 


49. 


6 


994 


11 


US-09-978-360A-110 


Sequence 110, App 


24 


914 


49. 


4 


2610 


17 


US-10-641-643-382 


Sequence 382, App 


25 


886 


47. 


9 


1798 


18 


US-10-466-258-10 


Sequence 10, Appl 


26 


867 


46. 


9 


1514 


9 


US-09-823-245A-349 


Sequence 34 9, App 


27 


762 


41. 


2 


2331 


17 


US-10-267-502-213 


Sequence 213, App 


28 


762 


41. 


2 


3202 


9 


US-09-95.4-456-210 


Sequence 210, App 


29 


762 


41. 


2 


3202 


17 


US-10-172-118-386 


Sequence 386, App 


30 


762 


41. 


2 


3202 


17 


US-10-342-887-386 


Sequence 386, App 


31 


762 


41. 


2 


3202 


18 


US-10-723-860-1480 


Sequence 1480, Ap 


32 


762 


41. 


2 


3202 


19 


US-10-843-641A-3237 


Sequence 3237, Ap 


33 


758.5 


41. 


0 


2343 


17 


US-10-267-502-215 


Sequence 215, App 


34 


755 


40. 


8 


3305 


18 


US-10-723-860-5926 


Sequence 5926, Ap 


35 


715.5 


38. 


7 


1502 


15 


US-10-205-219-94 


Sequence 94, Appl 



36 
37 
38 
39 
40 
41 
42 
43 
44 
45 



703 
699 
695 
678 
678 
678 
677 
674 
674 
672 



38.0 
37.8 
37.6 
36.6 
36.6 
36.6 
36.6 
36.4 
36.4 
36.3 



1520 
1473 

422 
1766 
1766 
1766 
1915 
1668 
2768 

422 



15 
15 
9 

10 
10 
17 
17 
9 

18 
9 



US-10-084-817-333 
US-10-205-194-128 

US-09-960-352-8477 
US-09-809-391-254 
US-09-882-171-254 
US-10-164-861-254 
US-10-276-774-980 

US-09-765-205-25 



US-10-723-860-6867 
US-09-960-352-11567 



Sequence 333 , App 
Sequence 128, App 

Sequence 8477, Ap 
Sequence 254, App 
Sequence 254, App 
Sequence 254, App 
Sequence 980, App 

Sequence 25, Appl 
Sequence 6867, Ap 

Sequence 11567, A 



ALIGNMENTS 



RESULT 1 
US-09-789-386-5 

; Sequence 5, Application US/09789386 

; Patent No. US20020010324A1 

; GENERAL INFORMATION: 

; APPLICANT: MICHALOVICH, DAVID 

; APPLICANT: PRINJHA, RABINDER KUMAR 

; TITLE OF INVENTION: NOVEL COMPOUNDS 

; FILE REFERENCE: GP-30165-C1 

; CURRENT APPLICATION NUMBER: US/09/789,386 

; CURRENT FILING DATE: 2001-02-21 

; PRIOR APPLICATION NUMBER: U.K. 9916898.1 

; PRIOR FILING DATE: 1999-07-19 

; PRIOR APPLICATION NUMBER: U.K. 9816024.5 

; PRIOR FILING DATE: 1998-07-22 

; PRIOR APPLICATION NUMBER: US 09/359,208 

; PRIOR FILING DATE: 1999-07-22 

; NUMBER OF SEQ ID NOS : 6 

SOFTWARE: FastSEQ for Windows Version 3.0 
; SEQ ID NO 5 

LENGTH: 1122 

TYPE: DNA 

ORGANISM: HOMO SAPIENS 
US-09-789-386-5 



Alignment Scores: 








Pred. No. : 


5.13e-152 


Length : 


1122 


Score : 


1829.00 


Matches : 


360 


Percent Similarity: 


96.51% 


Conservative : 


0 


Best Local Similarity: 


96.51% 


Mismatches : 


1 


Query Match: 


98.86% 


Indels : 


12 


DB: 


9 


Gaps : 


1 



SEQ29 (1-361) x US-09-789-386-5 (1-1122) 



Qy 1 MetGluAspLeuAspGlnSerProLeuValSerSerSerAspSerProProArgProGln 20 

I I I II I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I II 
Db 1 ATGGAAGACCTGGACCAGTCTCCTCTGGTCTCGTCCTCGGACAGCCCACCCCGGCCGCAG 60 



Qy 

Db 



21 
61 



ProAlaPheLysTyrGlnPheValArgGluProGluAspGluGluGluGluGluGluGlu 4 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

C C CGC GT T CAAGT ACC AGT T C GT GAGGGAG C CC G AGGAC GAGGAGGAAGAAGAGGAGGAG 120 



GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - nucleic search, using f rame_plus_p2n model 



Run on: 



April 14, 2005, 17:27:17 



Search time 4007.05 Seconds 

(without alignments) 

3429.256 Million cell updates/sec 



Title: 

Perfect score: 
Sequence: 



SEQ29 
1850 

1 MEDLDQSPLVSSSDSPPRPQ 



VKDAMAKIQAKIPGLKRKAE 361 



Scoring table: 



BLOSUM62 



Xgapop 10.0 , Xgapext 

Ygapop 10.0 , Ygapext 

Fgapop 6.0 , Fgapext 

Delop 6.0 , Delext 



0.5 
0.5 
7.0 
7.0 



Searched: 



34239544 seqs, 19032134700 residues 



Total number of hits satisfying chosen parameters: 



68479088 



Minimum DB seq length: 0 



Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Command line parameters : 
-MODEL=f rame+_p2n. model -DEV=xlh 
-Q=/cgn2_l/USPTO_spool/NICHOLS-09-830-672- 
FUSION/runat_14042005_104546_4021/app_query. fasta_l. 1038 
-DB=EST -QFMT=fastap -SUFFIX=rst -MINMATCH=0 . 1 -LOOPCL=0 -LOOPEXT-0 
-UNITS=bits - START- 1 -END=-1 -MATRIX=blosum62 -TRANS=human4 0 . cdi -LIST=45 
-DOCALIGN=200 -THR_SCORE=pct -THR_MAX=100 -THR_MIN=0 -ALIGN=15 -MO DE= LOCAL 
-OUTFMT=pto -NORM=ext -HEAPSI ZE=500 -MINLEN=0 -MAXLEN=2 000000000 
-USER=NICHOLS-09-830-672-FUSION_@CGN_l_l_5533_@runat_14042005_104546_4021 
-NCPU=6 -ICPU=3 -NO_MMAP - LARGEQUERY -NEG_SCORES=0 -WAIT -DSPBLOCK=100 
-LONGLOG -DEV_TIMEOUT=120 -WARN_TIMEOUT=30 -THREADS=1 -XGAPOP=10 -XGAPEXT=0.5 
-FGAPOP=6 -FGAPEXT=7 -YGAPOP=10 -YGAPEXT=0 . 5 -DELOP=6 -DELEXT=7 

Database : EST:* 



1 
2 
3 
4 
5 
6 
7 
8 
9 



gb_estl : * 
gb_est2:* 
gb_htc : * 
gb__est3 : * 
gb_est4 : * 
gb_est5 : * 
gb_est6 : * 
gb_gssl : * 
gb_gss2 : * 



Pred. No. is the number of results predicted by chance to have a 



score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 



No. 


Score 


Match 


Length 


DB 


ID 


Description 


1 


1485 . 5 


80 . 3 


3533 


3 


AK034902 


AK034902 


Mus muscu 


2 


1246 


67 . 4 


1097 


5 


BX439214 


BX439214 


BX439214 


3 


1167 


63. 1 


911 


1 


AL549191 


AL549191 


AL549191 


4 


1080 . 5 


58 . 4 


781 


4 


BI079496 


BI079496 


602876306 


5 


1030.5 


55. 7 


1002 


5 


BX462171 


BX462171 


BX462171 


6 


978 


52 . 9 


677 


7 


CN791158 


CN791158 


4125836 B 


7 


954 . 5 


51 . 6 


924 


5 


BU845601 


BU845601 


AGEN COURT 


8 


939 


50 . 8 


667 


7 


CN42 9712 


CN429712 


170006000 


9 


932 . 5 


50 . 4 


708 


4 


BI157842 


BI157842 


602923001 


10 


927 


50 . 1 


598 


7 


CF118424 


CF118424 


fs543.zl 


11 


927 


50 . 1 


600 


9 


AY4 04 970 


AY404970 


Homo sapi 


12 


927 


50 . 1 


650 


6 


CB215381 


CB215381 


NISC np05 


13 


927 


50 . 1 


672 


7 


CK977984 


CK977984 


4109431 B 


14 


927 


50 . 1 


682 


6 


CB162885 


CB162885 


K-EST0223 


15 


927 


50.1 


712 


7 


CK971318 


CK971318 


4087182 B 


16 ■ 


927 


50 . 1 


743 


6 


rm o?ri 7 

V LJ X \J C~ \J X / 


CD102817 


AGENCOURT 


17 


927 


50 . 1 


758 


4 


RG6974 36 


BG697436 


602660623 


18 


927 


50 . 1 


788 


1 


AT.S334 fil 


AL533461 


AL533461 


19 


927 


50 . 1 


843 


4 


rc;s70?31 

L-) vJ / UjI. J X 


BG570231 


602590632 


20 


927 


50 . 1 


849 


7 


CR765672 


CR765672 


DKFZp469C 


c 21 


927 


50 . 1 


875 


1 


ALS734 94 


AL573494 


AL573494 


22 


927 


50 . 1 


1540 


3 


1 ft 69 


CR611869 


f ull-leng 


23 


927 


50.1 


1785 


3 




AF077050 


Homo sapi 


24 


924 


49.9 


730 


1 


AU?97347 


AU297347 


AU297347 


25 


923 . 5 


49 . 9 


810 


4 


RT080P3? 

LJ X \J (J \J £. >J 


BI080232 


602876650 


26 


923 


49.9 


592 


7 


PM4 R?fiO? 


CN482802 


hw24hl2.y 


27 


921 


49.8 


634 






CB067821 


iq38a06.y 


28 


921 


49.8 


670 


7 


CV0300? 9 


CV030029 


9024 Full 


29 


921 


49.8 


960 


7 


' CN646472 


CN646472 


ILLUMIGEN 


30 


921 


49.8 


983 


7 


CN803408 


CN803408 


ILLUMIGEN 


31 


921 


49.8 


1031 


7 


CN647521 


CN647521 


ILLUMIGEN 


32 


915 


49.5 


805 


7 


C0735185 


C0735185 


SlLE04clO 


33 


915 


49.5 


958 


4 


BM801698 


BM801698 


AGENCOURT 


34 


914 


49.4 


679 


4 


BI149602 


BI149602 


602848410 


35 


914 


49.4 


757 


4 


BG715173 


BG715173 


602675631 


36 


913 


49.4 


779 


7 


CO401465 


CO401465 


AGENCOURT 


37 


913 


49.4 


1042 


7 


CN805577 


CN805577 


ILLUMIGEN 


38 


912 


49.3 


871 


6 


CD110203 


CD110203 


AGENCOURT 


39 


911 


49.2 


600 


9 


AY404972 


AY404972 


Mus muscu 


40 


908 


49.1 


1013 


7 


CO048918 


CO048918 


ILLUMIGEN 


41 


908 


49. 1 


1018 


7 


CN801888 


CN801888 


ILLUMIGEN 


42 


906 


49.0 


990 


4 


BI691132 


BI691132 


603314519. 


43 


905 


48.9 


794 


1 


AU080127 


AU080127 


AU080127 


44 


905 


48.9 


799 


1 


AU080133 


AU080133 


AU080133 


45 


904 


48.9 


695 


7 


CV077130 


CV077130 


AGENCOURT 



ALIGNMENTS 



GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - nucleic search, using f rame_plus_p2n model 

Run on: April 16, 2005, 11:35:57 ; Search time 82.5388 Seconds 

(without alignments) 
3746.799 Million cell updates/sec 

Title: US-09-830-972-29_COPY_990_1178 
Perfect score: 931 

Sequence: 1 SWDLLYWRDIKKTGWFGA VKDAMAKIQAKIPGLKRKAE 189 

Scoring table: BLOSUM62 

Xgapop 10.0 , Xgapext 0.5 

Ygapop 10.0 , Ygapext 0.5 

Fgapop 6.0 , Fgapext 7 . 0 

Delop 6.0 , Delext 7.0 

Searched: 1202784 seqs, 818138359 residues 

Total number of hits satisfying chosen parameters: 2405568 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Command line parameters : 
-M0DEL=frame+_p2n. model -DEV=xlh 

Q=/cgn2_l/USPTO_spool_h/US09830972/runat_14 042005_094 8 4 4_18 372/app_query. fasta_l 
.3683 

-DB=Issued_Patents_NA -QFMT^f as tap -SUFFIX=rni -MINMATCH=0 . 1 -LOOPCL=0 
-LOOPEXT=0 -UNITS=bits -START=1 -END=-1 -MATRIX=blosum62 -TRANS =human4 0 . cdi 
-LIST=45 -DOCALIGN=200 -THR_SCORE=pct -THR_MAX=100 -THR_MIN=0 -ALIGN=15 
-MODE=LOCAL -OUTFMT=pto -NORM=ext -HEAPSIZE=500 -MINLEN=0 -MAXLEN=2000000000 
-USER=US09830972_@CGN_l_l_255_@runat_14042005_094844_18372 -NCPU=6 -ICPU=3 
-NO_MMAP -LARGEQUERY -NEG_SCORES=0 -WAIT -DSPBLOCK=100 -LONGLOG 

-DEV_TIMEOUT=120 -WARN_TIMEOUT=30 -THREADS=1 -XGAPOP=10 -XGAPEXT=0 . 5 -FGAPOP=6 
-FGAPEXT=7 -YGAPOP=10 -YGAPEXT=0.5 -DELOP=6 -DELEXT=7 

Database : Issued Patents NA:* 



/cgn2_6/ptodata/l/ina/5A_COMB. seq: * 
/cgn2_6/ptodata/l/ina/5B_COMB. seq: * 
/cgn2_6/ptodata/l/ina/6A_COMB. seq: * 
/cgn2_6/ptodata/l/ina/6B_COMB. seq: * 
/cgn2_6/ptodata/l/ina/PCTUS_COMB.seq:' 
/cgn2_6/ptodata/l/ina/backfilesl . seq: " 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 





No . 


Score 


Match 


Length 


DB 


ID 








Description 




1 


931 


100. 


0 


4822 


3 


US- 


09- 


484- 


970B-106 


Sequence 


106, App 




2 


927 


99. 


6 


799 


2 


US- 


08- 


700- 


607-2 


Sequence 


2, Appli 




3 


927 


99. 


6 


1669 


4 


US- 


09- 


949- 


016-3253 


Sequence 


3253, Ap 




4 


914 


98. 


2 


2610 


4 


US- 


09- 


023- 


655-382 


Sequence 


382, App 




5 


682 


73. 


3 


2069 


4 


us- 


09- 


949- 


016-3309 


Sequence 


3309, Ap 






682 


73. 


3 


3202 


4 


us- 


09- 


949- 


016-1127 


Sequence 


1127, Ap 




7 


627 . 5 


67. 


4 


2262 


4 


us- 


09- 


949- 


016-2988 


Sequence 


2988, Ap 




8 


624 . 5 


67. 


1 


1766 


3 


us- 


09- 


149- 


476-254 


Sequence 


254, App 




9 


624 . 5 


67. 


1 


2664 


3 


us- 


09- 


149- 


476-255 


Sequence 


255, App 




10 


543 


58. 


3 


3517 


4 


us- 


09- 


799- 


451-111 


Sequence 


111, App 




11 


526 . 5 


56. 


6 


1095 


2 


us- 


08- 


700- 


607-4 


Sequence 


4, Appli 




12 


518 


55. 


6 


2014 


4 


us- 


09- 


270- 


767-13561 


Sequence 


13561, A 




13 


508 . 5 


54. 


6 


794 


3 


us- 


09- 


149- 


476-102 


Sequence 


102, App 




14 


473 . 5 


50. 


9 


2181 


4 


us- 


09- 


949- 


016-1419 


Sequence 


1419, Ap 




15 


34 1 


36. 


6 


601 


4 


us- 


09- 


949- 


016-117588 


Sequence 


117588, 




16 


34 1 


36. 


6 


601 


4 


us- 


09- 


949- 


016-117589 


Sequence 


117589, 




17 


341 


36. 


6 


42075 


4 


us- 


09- 


949- 


016-14995 


Sequence 


14995, A 




18 


286 


30. 


7 


443 


4 


us- 


09- 


513- 


999C-3784 


Sequence 


3784, Ap 






279.5 


30. 


0 


135667 
j- *j *j \j \j i 


4 


us- 


09- 


949- 


016-15051 

w -L \J J- V/ ^p/ -U 


Sequence 


15051, A 




20 


279.5 


30. 


0 


152486 


4 


us- 


09- 


949- 


016-12869 


Sequence 


12869, A 




21 


276.5 


29. 


,7 


601 


4 


us- 


09- 


949- 


016-40169 


Sequence 


40169, A 




22 


276.5 


29. 


,7 


601 


4 


us- 


09- 


949- 


016-119335 


Sequence 


119335, 






276 


29. 


,6 


261 


2 


us- 


08- 


700- 


607-9 


Sequence 


9, Appli 




24 


271 


29. 


,1 


13906 


4 


us- 


09- 


949- 


■016-14730 


Sequence 


14730, A 




25 


256 


27. 


.5 


200 


4 


us- 


09- 


513- 


999C-11198 


Sequence 


11198, A 




26 


236 . 5 


25. 


.4 


15661 


4 


us- 


■09- 


949- 


016-13161 


Sequence 


13161, A 




27 


144 


15. 


,5 


1125 


4 


us- 


■09- 


248- 


796A-1905 


Sequence 


1905, Ap 




28 


141 


15. 


.1 


601 


4 


us- 


■09- 


■949- 


016-40170 


Sequence 


40170, A 




29 


14 1 


15, 


.1 


601 


4 


us- 


09- 


•949- 


016-119336 


Sequence 


119336, 


Q 


30 


122 


13. 


. 1 


601 


4 


us- 


09- 


■949- 


016-117609 


Sequence 


117609, 




31 


111 


11, 


,9 


266 


4 


us- 


•09- 


313- 


•294A-703 


Sequence 


7 03, App 




32 


102 


11, 


,0 


601 


4 


us- 


•09- 


•949- 


■016-48087 


Sequence 


48087, A 




33 


100 


10, 


,7 


425 


3 


us- 


■08- 


905- 


■223-178 


Sequence 


178, App 




34 


95 


10. 


.2 


1722 


4 


us- 


•08- 


•956- 


171E-407 


Sequence 


4 07, App 


c 


35 


95 


10, 


.2 


1722 


4 


us- 


•08- 


■781- 


•986A-407 


Sequence 


407, App 




36 


89 


9, 


.6 


1828 


3 


us- 


■08- 


■487- 


•596-7 


Sequence 


7, Appli 




37 


89 


9, 


.6 


1828 


4 


us- 


-08- 


■660- 


■451A-7 


Sequence 


7, Appli 




38 


88.5 


9, 


.5 


4550 


3 


us- 


-09- 


-462- 


■136-1 


Sequence 


1, Appli 




39 


88.5 


9, 


.5 


4661 


4 


us- 


-09- 


-949- 


•016-4281 


Sequence 


4281, Ap 




40 


88.5 


9, 


.5 


4673 


4 


us- 


-09- 


-814- 


•915A-100 


Sequence 


100, App 




41 


88 


9, 


.5 


1273 


4 


us- 


-09- 


-949- 


•016-5725 


Sequence 


5725, Ap 




42 


87.5 


9, 


.4 


1779 


4 


us- 


-09- 


-134- 


-000C-2072 


Sequence 


2072, Ap 




43 


87.5 


9 


.4 


3156 


3 


us- 


-09- 


-134- 


■001C-2168 


Sequence 


2168, Ap 




44 


86 


9 


.2 


296 


4 


us- 


-09- 


-313- 


■294A-81 


Sequence 


81, Appl 




45 


85 


9 


. 1 


1302 


4 


us- 


-08- 


-956- 


-171E-322 


Sequence 


322, App 



ALIGNMENTS 



RESULT 1 

US-09-484-970B-106 



GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - nucleic search, using f rame_plus_p2n model 

Run on: April 16, 2005, 22:42:49 ; Search time 290.373 Seconds 

(without alignments) 
3948.747 Million cell updates/sec 



Title: US-09-830-972-29_COPY_990_1178 
Perfect score: 931 

Sequence: 1 SWDLLYWRDIKKTGWFGA VKDAMAKI QAKI PGLKRKAE 189 



Scoring table: BLOSUM62 

Xgapop 10.0 , Xgapext 0.5 

Ygapop 10.0 , Ygapext 0.5 

Fgapop 6.0 , Fgapext 7.0 

Delop 6.0 , Delext 7.0 



Searched: 



5622541 seqs, 3033355566 residues 



Total number of hits satisfying chosen parameters: 



11245082 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



Post-processing : 



Minimum Match 0% 
Maximum Match 100% 
Listing first 45 summaries 



Command line parameters: 
-MODEL=frame+_p2n. model -DEV=xlh 

Q=/cgn2__l/USPTO_spool_h/US09830972/runat__14042005_094 847_18466/app_query. fasta_l 
.3683 

-DB=Published_Applications_NA -QFMT=fastap -SUFFIX=rnpb -MINMATCH=0 . 1 
-LOOPCL=0 -LOOPEXT=0 -UNITS=bits -START=1 -END=-1 -MATRIX=blosum62 
-TRANS=human40.cdi -LIST=45 -DOCALIGN=200 -THR_SCORE=pct -THR_MAX=100 
-THR_MIN=0 -ALIGN=15 -MO DE= LOCAL -OUTFMT=pto -NORM=ext -HEAPSI ZE=50 0 -MINLEN=0 
-MAXLEN=2 000000000 -USER=US09830972_@CGN_l_l_2092_@runat_14042005_094847_18466 
-NCPU=6 -ICPU=3 -NO_MMAP - LARGEQUERY -NEG_SCORES=0 -WAIT -DSPBLOCK=100 
-LONGLOG -DEV_TIMEOUT=120 -WARN_TIMEOUT=30 -THREADS=1 -XGAPOP=10 -XGAPEXT=0.5 
-FGAPOP- 6 -FGAPEXT =7 -YGAPOP=10 -YGAPEXT=0 . 5 -DELOP=6 -DELEXT=7 



Database : Published_Applications_NA: * 

1: /cgn2_6/ptodata/2/pubpna/US07_PUBCOMB.seq:* 

2 : /cgn2_6/ptodata/2/pubpna/PCT_NEW_PUB . seq: * 

3 : /cgn2_6/ptodata/2/pubpna/US06_NEW_PUB. seq: * 

4 : /cgn2_6/ptodata/2/pubpna/US06_PUBCOMB. seq: * 

5: /cgn2_6/ptodata/2/pubpna/US07_NEW_PUB.seq:* 

6: /cgn2_6/ptodata/2/pubpna/PCTUS_PUBCOMB.seq: * 

7 : /cgn2_6/ptodata/2/pubpna/US08_NEW_PUB. seq:* 

8 : /cgn2_6/ptodata/2/pubpna/US08_PUBCOMB . seq : * 

9: /cgn2_6/ptodata/2/pubpna/US09A_PUBCOMB. seq: * 
10 : /cgn2_6/ptodata/2/pubpna/US09B_PUBCOMB . seq : * 



11 


/cgn2 


12 


/ cgn2 


13 


/cgn2 


14 


/ cgn2 


15 


/cgn2 


16 


/cgn2 


17 


/cgn2 


18 


/cgn2 


19 


/cgn2 


20 


/cgn2 


21 


/cgn2 


22 


/cgn2 



6/ptodata/2/pubpna/US09C_PUBCOMB.seq: 
_ 6/ptodata/2/pubpna/US09_NEW_PUB.seq:* 
]6/ptodata/2/pubpna/US10A_PUBCOMB.seq: 
~6/ptodata/2/pubpna/US10B_PUBCOMB.seq: 
]6/ptodata/2/pubpna/US10C_PUBCOMB.seq: 
~6/ptodata/2/pubpna/US10D_PUBCOMB.seq: 
~6/ptodata/2/pubpna/US10E_PUBCOMB.seq: 

6/ptodata/2/pubpna/US10F_PUBCOMB.seq: 
~6/ptodata/2/pubpna/US10_NEW_PUB.seq:* 
~6/ptodata/2/pubpna/USll_NEW_PUB . seq: * 
_6/ptodata/2/pubpna/US60_NEW_PUB.seq:* 
_6/ptodata/2/pubpna/US60_J>UBCOMB.seq:* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



% 

Result Query 

No. Score Match Length DB 



ID 



Description 



1 


931 


100 


. 0 


3579 


9 


US-09-789-386-1 


Sequence 1, Appli 


2 


931 


100 


. 0 


3579 


9 


US-09-893-348-22 


Sequence 22, Appl 


3 


931 


100 


. 0 


3579 


17 


US-10-267-502-212 


Sequence 212, App 


4 


931 


100 


.0 


3579 


18 


US-10-327-213-8 


Sequence 8, Appli 


5 


931 


100 


.0 


3579 


18 


US-10-466-258-8 


Sequence 8, Appli 


6 


931 


100 


.0 


3579 


18 


US-10-810-653-22 


Sequence 22, Appl 


7 


931 


100 


.0 


4053 


9' 


US-09-758-140-5 


Sequence 5, Appli 


8 


931 


100 


.0 


4053 


9 


US-09-972-599A-5 


Sequence 5, Appli 


9 


931 


100 


.0 


4053 


18 


US-10-717-597-310 


Sequence 310, App 


10 


931 


100 


.0 


4632 


14 


US-10-060-036-53 


Sequence 53, Appl 


11 


927 


99 


.6 


799 


17 


US-10-660-946-2 


Sequence 2, Appli 


12 


927 


99 


.6 


1122 


9 


US-09-789-386-5 


Sequence 5, Appli 


13 


927 


99 


.6 


1160 


14 


US-10-175-523-156 


Sequence 156, App 


14 


927 


99 


.6 


1610 


9 


US-09-765-205-5 


Sequence 5, Appli 


15 


927 


99 


.6 


1785 


17 


US-10-439-388-62 


Sequence 62, Appl 


16 


927 


99 


.6 


2052 


18 


US-10-466-258-3 


Sequence 3, Appli 


17 


927 


99 


.6 


2235 


14 


US-10-060-036-54 


Sequence 54, Appl 


18 


921 


98 


.9 


1980 


17 


US-10-220-891-22 


Sequence 22, Appl 


19 


918 


98 


.6 


994 


11 


US-09-978-360A-110 


Sequence 110, App 


20 


914 


98 


.2 


2610 


17 


US-10-641-643-382 


Sequence 382, App 


21 


908 


97 


.5 


4684 


9 


US-09-893-348-17 


Sequence 17, Appl 


22 


908 


97 


.5 


4684 


18 


US-10-810-653-17 


Sequence 17, Appl 


23 


905 


97 


.2 


2782 


15 


US-10-205-194-165 


Sequence 165, App 


24 


904.5 


97 


.2 


3492 


17 


US-10-267-502-214 


Sequence 214, App 


25 


886 


95 


.2 


1798 


18 


US-10-466-258-10 


Sequence 10, Appl 


26 


867 


93 


.1 


1514 


9 


US-09-823-245A-349 


Sequence 349, App 


27 


695 


74 


.7 


422 


9 


US-09-960-352-8477 


Sequence 8477, Ap 


28 


682 


73 


.3 


1520 


15 


US-10-084-817-333 


Sequence 333, App 


29 


682 


73 


.3 


2331 


17 


US-10-267-502-213 


Sequence 213, App 


30 


682 


73 


.3 


3202 


9 


US-09-954-456-210 


Sequence 210, App 


31 


682 


73 


.3 


3202 


17 


US-10-172-118-386 


Sequence 386, App 


32 


682 


73 


.3 


3202 


17 


US-10-342-887-386 


Sequence 386, App 


33 


682 


73 


.3 


3202 


18 


US-10-723-860-1480 


Sequence 1480, Ap 


34 


682 


73 


.3 


3202 


19 


US-10-843-641A-3237 


Sequence 3237, Ap 


35 


682 


73 


.3 


3305 


18 


US-10-723-860-5926 


Sequence 5926, Ap 



36 


681 


73. 


1 


1502 


15 


US-10-205-219-94 


Sequence 94, Appl 


37 


681 


73. 


1 


2343 


17 


US-10-267-502-215 


Sequence 215, App 


38 


672 


72. 


2 


422 


9 


US-09-960-352-11567 


Sequence 11567, A 


39 


665 


71. 


4 


1473 


15 


US-10-205-194-128 


Sequence 128, App 


40 


624.5 


67. 


1 


711 


17 


US-10-408-967-1 


Sequence 1, Appli 


41 


624.5 


67. 


1 


1330 


15 


US-10-106-698-1945 


Sequence 1945, Ap 


42 


624.5 


67. 


1 


1656 


9 


US-09-729-674-19 


Sequence 19, Appl 


43 


624.5 


67. 


1 


1656 


18 


US-10-913-553-19 


Sequence 19, Appl 


44 


624.5 


67. 


1 


1668 


9 


US-09-765-205-25 


Sequence 25, Appl 


45 


624.5 


67. 


1 


1766 


10 


US-09-809-391-254 


Sequence 254, App 



ALIGNMENTS 



RESULT 1 
US-09-789-386-1 

; Sequence 1, Application US/09789386 

; Patent No. US20020010324A1 

; GENERAL INFORMATION: 

; APPLICANT: MICHALOVICH, DAVID 

; APPLICANT: PRINJHA, RABINDER KUMAR 

; TITLE OF INVENTION: NOVEL COMPOUNDS 

; FILE REFERENCE: GP-30165-C1 

; CURRENT APPLICATION NUMBER: US/ 09/7 89 , 38 6 

; CURRENT FILING DATE: 2001-02-21 

; PRIOR APPLICATION NUMBER: U.K. 9916898.1 

; PRIOR FILING DATE: 1999-07-19 

; PRIOR APPLICATION NUMBER: U.K. 9816024.5 

; PRIOR FILING DATE: 1998-07-22 

; PRIOR APPLICATION NUMBER: US 09/359,208 

PRIOR FILING DATE: 1999-07-22 
; NUMBER OF SEQ ID NOS : 6 

SOFTWARE: FastSEQ for Windows Version 3.0 
; SEQ ID NO 1 

LENGTH: 357 9 
TYPE: DNA 

ORGANISM: HOMO SAPIENS 
US-09-789-386-1 



Alignment Scores: 

Pred. No.: 4.02e-106 Length: 3579 

Score: 931.00 Matches: 189 

Percent Similarity: 100.00% Conservative: 0 

Best Local Similarity: 100.00% Mismatches: 0 

Query Match: 100.00% Indels: 0 

DB: 9 Gaps: 0 

US-09-830-972-29_COPY_990_1178 (1-189) x US-09-789-386-1 (1-3579) 

Qy 1 SerValValAspLeuLeuTyrTrpArgAspIleLysLysThrGlyValValPheGlyAla 20 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 3010 T C AGTT GT T GAC CT CCT GT ACT GGAGAGAC AT TAAGAAGACT GGAGT G GT GT TT GGT GC C 3069 



Qy 21 SerLeuPheLeuLeuLeuSerLeuThrValPheSerlleValSerValThrAlaTyrlle 40 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I II I I M I I I 
Db 3070 AGC CT AT TCCTGCTGCTTT CAT T GACAGT ATT CAGC AT T GT GAG C GT AAC AGC CT AC AT T 3129 



GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - nucleic search, using f rame_plus_p2n model 



Run on: 



April 16, 2005, 11:34:27 ; Search time 1670.87 Seconds 

(without alignments) 
4305.640 Million cell updates/sec 



Title: 

Perfect score: 
Sequence: 



US-09-830-972-29_COPY_990_1178 
931 

1 SWDLLYWRDIKKTGWFGA 



VKDAMAKIQAKI PGLKRKAE 189 



Scoring table: 



BLOSUM62 



Xgapop 10.0 , Xgapext 

Ygapop 10.0 , Ygapext 

Fgapop 6.0 , Fgapext 

Delop 6.0 , Delext 



0.5 
0.5 
7.0 
7.0 



Searched: 



34239544 seqs, 19032134700 residues 



Total number of hits satisfying chosen parameters: 



68479088 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 

Listing first 45 summaries' 

Command line parameters : 
-M0DEL=frame+_p2n. model -DEV=xlh 

Q=/cgn2_l/USPTO_spool__h/US09830972/runat_14 042005_094843_18360/app_query. fasta_l 
.3683 

-DB=EST -QFMT=fastap -SUFFIX=rst -MINMATCH=0 . 1 -LOOPCL=0 -LOOPEXT=0 
-UNITS=bits -START=1 -END=-1 -MATRIX=blosum62 -TRANS=human40 . cdi -LIST=45 
-DOCALIGN=200 -THR_SCORE=pct -THR_MAX=100 -THR_MIN=0 -ALIGN=15 - MO DE= LOCAL 
-OUTFMT=pto -NORM=ext -HEAPSIZE=500 -MINLEN=0 -MAXLEN=2 00 0 00 0 00 0 
-USER=US09830972_@CGN_l_l_13747_@runat_14042005_094843_18360 -NCPU=6 -ICPU-3 
-NO_MMAP -LARGEQUERY -NEG_SCORES=0 -WAIT -DSPBLOCK=100 -LONGLOG 

-DEV_TIMEOUT=120 -WARN_TIMEOUT=30 -THREADS-1 -XGAPOP=10 -XGAPEXT=0 . 5 -FGAPOP=6 
-FGAPEXT =7 -YGAPOP=10 -YGAPEXT=0 . 5 -DELOP=6 -DELEXT=7 

Database : EST:* 



1 

2 
3 
4 
5 
6 
7 
8 
9 



gb_estl : * 
gb_est2:* 
gb_htc : * 
gb_est3 : * 
gb_est4 : * 
gb_est5 : * 
gb_est6 : * 
gb_gssl : * 
gb_gss2 : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 



No. 


Score 


Match 


Length 


DB 


ID 


Description 


1 


927 


99. 


6 


598 


7 


CF118424 


CF118424 


fs543. zl 


2 


927 


99. 


6 


600 


9 


AY404970 


AY404970 


Homo sapi 


3 


927 


99. 


6 


650 


6 


CB215381 


CB215381 


NISC__np05 


4 


927 


99. 


6 


667 


7 


CN429712 


CN429712 


170006000 


5 


927 


99. 


6 


672 


7 


CK977984 


CK977984 


4109431 B 


6 


927 


99. 


6 


682 


6 


CB162885 


CB162885 


K-EST0223 


7 


927 


99. 


6 


712 


7 


CK971318 


CK971318 


4087182 B 


8 


927 


99. 


6 


743 


6 


CD102817 


CD102817 


AG EN COURT 


9 


927 


99. 


6 


758 


4 


BG697436 


BG697436 


602660623 


10 


927 


99. 


6 


788 


1 


AL533461 


AL533461 


AL533461 


11 


927 


99. 


6 


843 


4 


BG570231 


BG570231 


602590632 


12 


927 


99. 


6 


849 


7 


CR765672 


CR765672 


DKFZp469C 


13 


927 


99. 


6 


875 


1 


AL573494 


AL573494 


AL573494 


14 


927 


99. 


6 


1540 


3 


CR611869 


CR611869 


f ull-leng 


15 


927 


99. 


6 


1785 


3 


AF077050 


AF077050 


Homo sapi 


16 


924 


99. 


2 


730 


1 


AU297347 


AU297347 


AU297347 


17 


921 


98. 


9 


634 


6 


CB067821 


CB067821 


iq38a06.y 


18 


921 


98. 


9 


670 


' 7 


CV030029 


CV030029 


9024 Full 


19 


921 


98. 


9 


960 


7 


CN646472 


CN646472 


ILLUMIGEN 


20 


921 


98. 


9 


983 


7 


CN803408 


CN803408 


ILLUMIGEN 


21 


921 


98. 


9 


1031 


7 


CN647521 


CN647521 


ILLUMIGEN 


22 


915 


98. 


3 


805 


7 


C0735185 


C0735185 


SlLE04clO 


23 


915 


98. 


3 


958 


4 


BM801698 


BM801698 


AGENCOURT 


24 


914 


98. 


2 


757 


4 


BG715173 


BG715173 


602675631 


25 


913 


98. 


1 


924 


5 


BU845601 


BU845601 


AGENCOURT 


26 


913 


98. 


1 


1042 


7 


CN805577 


CN805577 


ILLUMIGEN 


27 


912 


98. 


0 


592 


7 


CN482802 


CN482802 


hw24hl2.y 


28 


912 


98. 


0 


871 


6 


. CD110203 


CD110203 


AGENCOURT 


29 


911 


97. 


9 


600 


9 


AY404972 


AY404972 


Mus muscu 


30 


911 


97. 


9 


679 


4 


BI149602 


BI149602 


602848410 


31 


911 


97. 


9 


708 


4 


BI157842 


BI157842 


602923001 


32 


908 


97. 


5 


779 


7 


CO401465 


CO401465 


AGENCOURT 


33 


908 


97. 


5 


1013 


7 


CO048918 


CO048918 


ILLUMIGEN 


34 


908 


97. 


5 


1018 


7 


CN801888 


CN8.0188 8 


ILLUMIGEN 


35 


905 


97. 


2 


794 


1 


AU080127 


AU080127 


AU080127 


36 


905 


97. 


2 


799 


1 


AU080133 


AU080133 


AU080133 


37 


904 


97. 


1 


695 


7 


CV077130 


CV077130 


AGENCOURT 


38 


904 


97. 


1 


758 


4 


BG740561 


BG740561 


602633075 


39 


904 


97. 


1 


791 


7 


CF977898 


CF977898 


F26A06_04 


40 


904 


97. 


1 


1081 


7 


CN641703 


CN641703 


ILLUMIGEN 


41 


902 


96. 


9 


747 


4 


BG623462 


BG623462 


602648520 


42 


900 


96. 


7 


585 


7 


C0259245 


C0259245 


4130644 B 


43 ' 


899 


96. 


6 


742 


4 


BI838242 


BI838242 


603083162 


44 


896 


96. 


2 


3533 


3 


AK034902 


AK034902 


Mus muscu 


45 


894 


96. 


0 


781 


4 


BI079496 


BI079496 


602876306 



ALIGNMENTS 



GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - nucleic search, using f rame_plus_p2n model 



Run on: 



April 16, 2005, 11:30:21 ; 



Search time 2310.43 Seconds 

(without alignments) 

3963.787 Million cell updates/sec 



Title: 

Perfect score : 
Sequence : 



US-09-830-972-29_COPY_990_1178 
931 

1 SWDLLYWRDI KKTGWFGA 



VKDAMAKIQAKIPGLKRKAE 189 



Scoring table : 



BLOSUM62 



Xgapop 10.0 , Xgapext 

Ygapop 10.0 , Ygapext 

Fgapop 6.0 , Fgapext 

Delop 6.0 , Delext 



0.5 
0.5 
7.0 
7.0 



Searched: 



4708233 seqs, 24227607955 residues 



Total number of hits satisfying chosen parameters: 



9416466 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Command line parameters: 
-MODEL=frame+_p2n. model -DEV=xlh 

Q=/cgn2_l/USPTO_spool_h/US09830972/runat_14042005_094842_1834 8/app_query. fasta__l 
.3683 

-DB=GenEmbl -QFMT=fastap -SUFFIX=rge -MINMATCH=0 . 1 -LOOPCL=0 -LOOPEXT=0 
-UNITS=bits -START=1 -END=-1 -MATRIX=blosum62 -TRANS=human4 0 . cdi -LIST=45 
-DOCALIGN=200 -THR_SCORE=pct -THR_MAX=100 -THR__MIN=0 -ALIGN=15 - MO DE= LOCAL 
-OUTFMT=pto -NORM=ext -HEAPSIZE=500 -MINLEN=0 -MAXLEN=2000000000 
-USER=US09830972_@CGN_l_l_19954_@runat_14042005_094 842_1834 8 -NCPU=6 -ICPU=3 
-NO_MMAP - LARGEQUERY -NEG_SCORES=0 -WAIT -DSPBLOCK=100 -LONGLOG 

-DEV_TIMEOUT=120 -WARN_TIMEOUT=30 -THREADS=1 -XGAPOP=10 -XGAPEXT=0 . 5 -FGAPOP=6 
-FGAPEXT =7 -YGAPOP=10 -YGAPEXT=0.5 -DELOP=6 -DELEXT=7 

Database : GenEmbl:* 



1: 
2: 
3: 
4: 
5: 
6: 
7: 
8: 
9: 
10 



gb_ba : * 
gb__htg : * 
gb__in : * 
gb_om : * 
gb_ov: * 
gb_pat : * 
gb_ph : * 
gb_pl : * 
gb_pr : * 
gb_ro : * 



11: gb_sts:* 

12: gb_sy:* 

13: gb_un : * 

1 4 : gb_vi : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



1 

X 


9^1 

J ~J X 


100 . 


o 


2481 


9 


AF063601 


AF063601 


Homo sapi 


o 


9^1 

J -J X 


100 . 


o 


2883 


9 


AF320999 


AF320999 


Homo sapi 


-3 
O 


9^1 

_7 O X 


i on 

X \J \J . 


Q 


3491 


9 


^i.L J J J J J U 


AF333336 


Homo saoi 


A 
*i 


9^1 

_? O X 


i on 


o 


3576 




AX766050 


AX766050 


Sequence 


-J 


9^1 

-7 O X 


i on 


o 


3579 


5 


RD24 94 4 6 


BD24 94 46 


Protein s 


D 


9^1 

_? O X 


i on 


n 
\j 


^S79 


Q 

Zf 


HSA? SI ^8^ 


AJ251383 


Homo saoi 


7 


9^1 


inn' 

x vj u • 


0 


^91 9 

J J X -7 


a 
\j 


m8?94 86 


C0829486 


Secfuence 

U V-* \A X X V*** 


Q 

o 


9^1 


i on 

X U W • 


o 


*± U *J •J 


a 


AX 1 9S?49 


AX195249 


Seauence 

iw* \A v-^ X X \»-» ^— 


q 


9^1 


1 00 




4053 


9 


AR020693 


AB020693 


Homo saDi 


i n 


9^1 

_7 O X 


i no 


o 


4060 


9 


AY123250 


AY123250 


Homo sapi 


1 1 

X J. 


_7 O X 


i nn 

X V w • 


o 


4070 


9 


AY123249 


AY123249 


Homo sapi 


X 


9^1 


100 . 


o 


4093 


5 


BD270070 

l_?X/£. f \J \J 1 \J 


BD270070 


Secreted 




9^1 

^ -J X 


100 . 


o 


4102 


9 


AY123245 


AY123245 


Homo sapi 


1 4 

X *i 


^ ~J X 


100 . 


o 


4109 


9 


AY123248 


AY123248 


Homo sapi 


15 


931 


100 . 


o 


4123 


9 


AY123247 


AY123247 


Homo sapi 


16 


931 


100 . 


o 


4160 


9 


AY123246 


AY123246 


Homo sapi 


17 


931 


100 . 


o 


4166 


9 


AB040462 


AB040462 


Homo sapi 


18 


931 


100 . 


o 


4632 


9 


AF148537 


AF148537 


Homo sapi 


19 


931 


100 . 


o 


4789 


6 


CQ874017 


CQ874017 


Sequence 


20 


931 


100. 


o 


4789 


9 


AY102279 


AY102279 


Homo sapi 


21 


931 


100. 


o 


4822 


6 


AR220865 


AR220865 


Sequence 


22 


928 


99. 


7 


1784 


9 


BC016165 


BC016165 


Homo sapi 


23 


928 


99. 


7 


2389 


9 


AY102278 


AY102278 


Homo sapi 


24 


927 


99. 


6 


600 


9 


HSA251385 


AJ251385 


Homo sapi 


25 


927 


99. 


6 


799 


6 


AR028522 


AR028522 


Sequence 


26 


927 


99. 


6 


1079 


9 


BC007109 


BC007109 


Homo sapi 


27 


927 


99. 


6 


1122 


6 


BD249448 


BD249448 


Protein s 


28 


927 


99. 


6 


1122 


9 


HSA251384 


AJ251384 


Homo sapi 


29 


927 


99. 


6 


1151 


9 


BC001035 


BC001035 


Homo sapi 


30 


927 


99. 


6 


1213 


6 


BD194907 


BD194907 


86 human 


31 


927 


99. 


6 


1213 


6 


CQ855235 


CQ855235 


Sequence 


32 


927 


99. 


6 


1466 


9 


BC071848 


BC071848 


Homo sapi 


33 


927 


99. 


6 


1485 


9 


BC010737 


BC010737 


Homo sapi 


34 


927 


99. 


6 


1525 


9 


AK130812 


AK130812 


Homo sapi 


35 


927 


99. 


6 


1599 


6 


CQ769577 


CQ769577 


Sequence 


36 


927 


99. 


6 


1610 


6 


BD231889 


BD231889 


Bone marr 


37 


927 


99. 


6 


1617 


9 


AF087901 


AF087901 


Homo sapi 


38 


927 


99. 


6 


1619 


9 


AK129806 


AK129806 


Homo sapi 


39 


927 


99. 


6 


1654 


9 


BC012619 


BC012619 


Homo sapi 


40 


927 


99. 


6 


1668 


9 


BC026788 


BC026788 


Homo sapi 


41 


927 


99. 


6 


1691 


9 


AF132048 


AF132048 


Homo sapi 


42 


927 


99. 


6 


1698 


9 


BC014366 


BC014366 


Homo sapi 


43 


927 


99. 


6 


1700 


9 


AF177332 


AF177332 


Homo sapi 



44 927 99.6 1709 9 AB040463 

45 927 99.6 1721 4 AY164744 



AB040463 Homo sapi 
AY164744 Bos tauru 



RESULT 1 
AF063601 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



FEATURES 

source 



CDS 



ALIGNMENTS 



AF063601 2481 bp mRNA linear PRI 02-JAN-2001 

Homo sapiens brain my04 3 protein mRNA, complete cds . 

AF063601 

AF063601 . 1 GI : 12002033 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (bases 1 to 2481) 

Mao, Y.M. , Xie,Y. and Zheng, Z.H. 

Direct Submission 

Submitted ( 05-MAY-1998 ) Institute of Genetics, School of Life 
Science, Fudan University, 220 Handan Rd., Shanghai 200433, 
P.R.China 

Location/Qualif iers 
1. .2481 

/organism= M Homo sapiens" 
/mol_type="mRNA" 
/db_xref="taxon:9606" 
/clone="011al2" 
/tissue_type= "brain" 
/dev_stage=" fetus" 
194. .2137 
/codon_start=l 

/product="brain my043 protein" 
/protein_id="AAG43160. 1" 
/db_xref="GI : 12002034 " 

/ trans la tion="MPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQES 
LYPAAQLCPSFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYE 
SIKHEPENPPPYEEAMSVSLKKVSGIKEEIKEPENIMQLFKKQKLLIYLLHVDLIKET 
KLSAEPAPDFSDYSEMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDET 
VMLVKESLTETSFESMIEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVST 
LSKKEKIPLQMEELSTAVYSNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKT 
DSFSKLAREYTDLEVSHKSEIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDD 
FSKNGSATSKVLLLPPDVSALATQAEIESIVKPKVLVKEAEKKLPSDTEKEDRSPSAI 
FSAELSKTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSWAYIALALLSVTIS 
FRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNCTIKELRRLFL 
VD DLVD S L KFAVLMWVFT YVGAL FN G LT L L I LALI SLFSVPVI YERHQAQI DHYLGLA 
NKNVKDAMAKIQAKI PGLKRKAE" 



ORIGIN 



Alignment Scores: 
Pred. No. : 
Score : 

Percent Similarity: 
Best Local Similarity: 
Query Match: 
DB: 



5.99e-87 

931.00 

100.00% 

100.00% 

100.00% 

9 



Length: 24 81 

Matches: 189 

Conservative: 0 

Mismatches : 0 

Indels: 0 

Gaps : 0 



GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - nucleic search, using f rame_plus_p2n model 

Run on: April 16, 2005, 11:29:46 ; Search time 267.442 Seconds 

(without alignments) 
4183.459 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 



US-09-830-972-29_COPY_990_117 8 
931 

1 SWDLLYWRDIKKTGWFGA VKDAMAKIQAKIPGLKRKAE 189 



Scoring table: BLOSUM62 

Xgapop 10.0 , Xgapext 0.5 

Ygapop 10.0 , Ygapext 0.5 

Fgapop 6.0 , Fgapext 7.0 

Delop 6.0 , Delext 7.0 



Searched: 



4390206 seqs, 2959870667 residues 



Total number of hits satisfying chosen parameters: 



8780412 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Command line parameters : 
-MODEL=frame+_p2n. model -DEV=xlh 

Q=/cgn2_l/USPTO_spool_h/US09830972/runat_14 042005_094842_18342/app_query. fasta_l 
.3683 

-DB=N_Geneseq -QFMT=fastap -SUFFIX=rng -MINMATCH=0 . 1 -LOOPCL=0 -LOOPEXT=0 
-UNITS=bits - START- 1 -END=-1 -MATRIX=blosum62 -TRANS=human4 0 . cdi -LIST=45 
-DOCALIGN=200 -THR_SCORE=pct -THR_MAX=100 -THR_MIN=0 -ALIGN=15 -MODE=LOCAL 
-OUTFMT=pto -N0RM=ext -HEAPSIZE=500 -MINLEN=0 -MAXLEN=2 000000000 
-USER=US09830972_@CGN_l_l_1955_@runat_14042005_094 842_18342 -NCPU=6 -ICPU=3 
-NO_MMAP -LARGEQUERY -NEG_SCORES=0 -WAIT -DSPBLOCK=100 -LONGLOG 

-DEV_TIMEOUT=120 -WARN_TIMEOUT=30 - TH READS =1 -XGAPOP=10 -XGAPEXT=0.5 -FGAPOP=6 
-FGAPEXT =7 -YGAPOP=10 -YGAPEXT=0.5 -DELOP=6 -DELEXT=7 



Database : N_Geneseq__16Dec04 : * 

1: geneseqnl980s : * 

2: geneseqnl990s : * 

3: geneseqn2000s : * 

4: geneseqn2001as : * 

5: geneseqn2001bs : * 

6: geneseqn2002as : * 

7: geneseqn2002bs : * 

8: geneseqn2003as : * 

9: geneseqn2003bs : * 
10: geneseqn2003cs : * 



11 

12 
13 



geneseqn2003ds : * 
geneseqn2004as : * 
geneseqn2004bs : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



1 


931 


100 . 


o 


3579 


3 


AA7. S688 6 


Aaz568 8 6 Human MAG 


2 


931 


100 . 


o 


3579 


4 


AAF90324 


Aaf90324 Human NOG 


3 


931 


100 . 


o 


3579 




ARK901 34 


Abk90134 DNA encod 


4 


931 


100 . 


o 


3579 




ARN8 6601 


Abn86601 Human neu 


5 


931 


100 . 


o 


3579 


12 


ADO078 8 6 


Ado07 8 8 6 Human pol 




931 


100 . 


o 


3579 


12 


ADR1 396S 


Adrl39 65 Human NOG 


7 


931 


100 . 


o 


3833 


3 


AAD01174 


Aad01174 Bovine ne 


3 


931 


100 . 


o 


3919 


12 


ADP4 S S SO 


Adn4 S S 5 0 Human Nocr 


9 


931 


100 . 


o 


4053 


4 


AAS094 S3 


Aa s 0 9 4 5 3 Human cDN 


X \J 


_? -~> X 


i no 

X \J \J • 


o 


4053 


9 


• ACT8 1 04 8 


Arr8104 8 Human Noa 

AXv-* V^* KJ JL. v * U IX LAX L LCI X X 11 ^> 


X X 


_7 —> X 


100 . 


o 


4053 


12 


ADP1 3S74 


Adr>13574 Renal eel 


12 


931 


100 . 


o 


4093 


3 


A A A2 34 54 


Aaa23454 cDNA enco 


13 


931 


100 . 


o 


4632 




ARV9 4 68 0 


Abv94 68 0 Human nan 

1 LXw^ V _/ i W XI \ 4 1 L1CX X X k-/ X X 


14 


931 


100 . 


o 


4632 


10 


ADG32772 


Adg 32772 Human DNA 


15 


931 


100 . 


o 


4698 


8 


ABX34 563 


Abx3 4 563 Human mdd 


16 


931 


100 . 


o 


4789 


13 


ADR8 3534 


Adr83534 Human ret 


17 


931 


100 . 


o 


4822 


6 


ABS70449 


Abs70449 Human bon 


18 


928 


99 . 


7 


1758 


4 


AAF32725 


Aaf32725 Human sec 


19 


927 


99 . 




600 


4 


AAF90323 


Aaf90323 Human NOG 


20 


927 


99 . 


6 


770 


3 


AAA7 2983 


Aaa72983 Human NSP 


21 


927 


99. 


6 


799 


2 


AAV2 3695 


Aav23695 Human NSP 


22 


927 


99. 


6 


1122 


3 


AAZ 56888 


Aaz56888 Human MAG 


23 


927 


99. 


6 


1122 


4 


AAF90325 


Aaf90325 Human NOG 


24 


927 


99. 


6 


1213 


2 


AAX04379 


Aax04379 Human sec 


25 


927 


99 . 


6 


1216 


6 


ABA0 5903 


Aba 05 9 03 Human RTN 


26 


927 


99. 


6 


1599 


10 


ADI62860 


Adi628 60 Human apo 


27 


927 


99. 


6 


1610 


3 


AAZ36230 


Aaz36230 cDNA enco 


28 


927 


99. 


6 


1785 


12 


ADK14166 


Adkl4166 Human aut 


29 


927 


99. 


6 


2052 


6 


ABK90133 


Abk90133 DNA encod 


30 


927 


99. 


6 


2235 


6 


ABV94681 


Abv94681 Human pan 


31 


927 


99. 


6 


2240 


3 


AAC64406 


Aac64406 Human Nog 


32 


923 


99. 


1 


1694 


4 


AAK94408 


Aak94408 Human ful 


33 


923 


99. 


1 


1694 


12 


ADL31137 


Adl31137 Full leng 


34 


921 


98. 


9 


1980 


4 


AAI98079 


Aai98079 Human neu 


35 


918 


98. 


6 


991 


2 


AAX97587 


Aax97587 Extended 


36 


918 


98. 


6 


994 


12 


ADP18854 


Adpl8854 Human sec 


37 


914 


98. 


2 


2610 


11 


ADI31056 


Adi31056 Human cDN 


38 


911 


97. 


9 


734 


12 


ADF42781 


Adf42781 Mouse CYP 


39 


908 


97. 


5 


3492 


12 


ADP45571 


Adp45571 Rat NogoA 


40 


908 


97. 


5 


4684 


3 


AAD01173 


Aad01173 Rat neuri 


41 


908 


97. 


5 


4684 


6 


ABN86600 


Abn86600 Rat neuro 


42 


905 


97. 


2 


2782 


10 


ADB85284 


Adb85284 Rat fooce 


43 


904.5 


97. 


2 


3492 


12 


ADO07888 


Ado07888 Mouse pol 


44 


904 


97. 


1 


1568 


3 


AAD01175 


Aad01175 Rat neuri 



45 894.5 96.1 2386 2 AAV30920 



Aav30920 Human sec 



ALIGNMENTS 

RESULT 1 
AAZ56886 

ID AAZ56886 standard; DNA; 3579 BP. 
XX 

AC AAZ56886; 
XX 

DT 25-APR-2000 (first entry) 
XX 

DE Human MAGI polypeptide encoding DNA. 
XX 

KW MAGI protein; neuroendocrine-specif ic protein; neuropathy; human; 

KW spinal injury; neuronal degeneration; neuromuscular disorder; cancer; 

KW psychiatric disorder; developmental disorder; inflammatory disorder; 

KW. stroke; cytostatic; cerebroprotective; neuroprotective; ds . 

XX 

OS Homo sapiens . 
XX 

FH Key Location/Qualifiers 

FT CDS 1. .3579 

FT /*tag= a 

FT /product= "MAGI polypeptide" 
XX 

PN WO200005364-A1. 
XX 

PD 03-FEB-2000. 
XX 

PF 21-JUL-1999; 99WO-GB002360 . 
XX 

PR 22-JUL-1998; 98GB-00016024 . 

PR 19-JUL-1999; 99GB-00016898 . 
XX 

PA (SMIK ) SMITHKLINE BEECHAM PLC. 
XX 

PI Michalovich D, Prinjha RK; 
XX 

DR WPI; 2000-182693/16. 

DR P-PSDB; AAY56967. 
XX 

PT Novel polypeptides related to neuroendocrine-specif ic proteins and 

PT polynucleotides useful for diagnosis of various diseases and for 

PT treatment of cancer and neurological disorders. 
XX 

PS Claim 5; Page 19-20; 35pp; English. 
XX 

CC The invention relates to human MAGI protein, which is similar to 

CC neuroendocrine-specif ic protein. The MAGI protein can be expressed by 

CC standard recombinant methodology. The MAGI polypeptides, polynucleotides 

CC and antibodies are useful for treating diseases, including neuropathies, 

CC spinal injury, neuronal degeneration, neuromuscular disorders, 

CC psychiatric disorders and developmental disorders, cancer, stroke and 

CC inflammatory disorders. The polynucleoitde is also useful for chromosome 

CC localization and for tissue expression studies. The present sequence 



GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - nucleic search, using f rame_plus_p2n model 



Run on: 



April 16, 2005, 11:34:27 



Search time 10414.2 Seconds 

(without alignments) 

4305.640 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 



US-09-830-972-29 
5923 

1 MEDLDQSPLVSSSDSPPRPQ 



VKDAMAKIQAKI PGLKRKAE 1178 



Scoring table: 



BLOSUM62 



Xgapop 10.0 , Xgapext 

Ygapop 10.0 , Ygapext 

Fgapop 6.0 , Fgapext 

Delop 6.0 , Delext 



0.5 
0.5 
7.0 
7.0 



Searched: 



34239544 seqs, 19032134700 residues 



Total number of hits satisfying chosen parameters: 



68479088 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Command line parameters : 
-MODEL=frame+_p2n. model -DEV=xlh 

Q=/cgn2_l/USPTO_spool_h/US09830972/runat_14042 005_094843_18360/app_query. fasta 
.3683 

-DB=EST -QFMT-fastap -SUFFIX=rst -MINMATCH=0 . 1 -LOOPCL=0 -LOOPEXT=0 
-UNITS=bits -START=1 -END=-1 -MATRIX=blosum62 -TRANS=human4 0 . cdi -LIST=45 
-DOCALIGN=200 -THR_SCORE=pct -THR_MAX=100 -THR_MIN=0 -ALIGN- 15 - MO DE= LOCAL 
-OUTFMT=pto -NORM=ext -HEAPSIZE=500 -MINLEN=0 -MAXLEN=2000000000 
-USER=US09830972_@CGN_l_l_13747_@runat_14042005_094843_18360 -NCPU=6 -ICPU=3 
-NO_MMAP -LARGEQUERY -NEG_SCORES=0 -WAIT -DSPBLOCK=l 00 -LONGLOG 

-DEV_TIMEOUT=120 -WARN_TIMEOUT=30 -THREADS=1 -XGAPOP=10 -XGAPEXT=0 . 5 -FGAPOP=6 
-FGAPEXT =7 -YGAPOP=10 -YGAPEXT=0 . 5 -DELOP=6 -DELEXT=7 

Database : EST:* 



1 
2 
3 
4 
5 
6 
7 
8 
9 



gb 
gb 
gb 
gb 
gb 
gb 
gb 
gb 
gb 



estl: * 
est2:* 
htc: * 
est3:* 
*est4:* 
est5: * 
est6:* 
gssl : * 
gss2 : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution.. 

SUMMARIES 

% 

Result Query 



No. 


Score 


Match 


Length 


DB 


ID 


Description 


1 


1339 . 5 


22 . 6 


1024 


7 


CN643847 


CN643847 


ILLUMIGEN 


2 


1337 . 5 


22 . 6 


871 


5 


BQ719894 


BQ719894 


AG EN COURT 


3 


1337 


22 . 6 


1013 


7 


CN646105 


CN646105 


ILLUMIGEN 


4 


1314 


22 . 2 


898 


7 


CN641948 


CN641948 


ILLUMIGEN 


5 


1304 


22 . 0 


901 


7 


CN646421 


CN646421 


ILLUMIGEN 


6 


1291 . 5 


21.8 


954 


4 


BM470283 


BM470283 


AGEN COURT 


7 


1279 


21 . 6 


902 


7 


CN805514 


CN805514 


ILLUMIGEN 


8 


1224 


20.7 


1028 


4 


BM469346 


BM469346 


AGENCOURT 


g 


1215 


20.5 


746 


7 


CR745036 


CR745036 


CR745036 


10 


1202 . 5 


20.3 


969 


4 


BG706140 


BG706140 


602669409 


11 


1183 . 5 


20.0 


730 


4 


BI601346 


BI601346 


603245090 


12 


1153 


19 . 5 


699 


4 


BG819566 


BG819566 


602783209 


13 


1135 


19.2 


3533 


3 


AK034902 


AK034902 


Mus muscu 


14 


1122 . 5 


19 . 0 


896 


6 


CB204418 


CB204418 


AGENCOURT 


15 


1092 


18 . 4 


671 


7 


CR751085 


CR751085 


DKFZp459K 


16 


1091 


18 . 4 


842 


5 


BU709149 


BU709149 


UI-M-EW0- 


c 17 


1090 


18 . 4 


836 


5 


BX434 904 


BX434904 


BX434904 


18 


1087 . 5 


18 . 4 


863 


2 


BE783092 


BE783092 


601470682 


19 


1087 


18 . 4 


986 


5 


BU841009 


BU841009 


AGENCOURT 


20 


1067 


18 . 0 


742 


7 


CR751121 


CR751121 


DKFZp459L 


21 


1064 . 5 


18 . 0 


695 


7 


CR74994 0 


CR749940 


DKFZp459C 


22 


1063 


17 . 9 


822 


6 


CB521332 


CB521332 


UI-M-GH0- 


23 


1032 


17 . 4 


785 


6 


CA511870 


CA511870 


UI-R-FJ0- 


24 


1016. 5 


17 . 2 


778 


5 


BU709106 


BU709106 


UI-M-EW0- 


c 25 


1015 


17 . 1 


619 


1 


AI879953 


AI879953 


au49b09.x 


26 


1011 . 5 


17. 1 


803 


7 


CN539361 


CN539361 


UI-M-HU0- 


27 


998 . 5 


16. 9 


805 


4 


BI730192 


BI730192 


603349739 


c 28 


997 


16. 8 


611 


4 


BM669752 


BM669752 


UI-E-DX1- 


29 


994 . 5 


16.8 


626 


5 


BQ639205 


BQ639205 


hd32f03.y 


30 


976 


16.5 


779 


7 


CO401465 


CO401465 


AGENCOURT 


31 


971 


16. 4 


969 


5 


BU839934 


BU839934 


AGENCOURT 


32 


965 


16. 3 


580 


5 


BP255731 


BP255731 


BP255731 


33 


962 


16.2 


582 


5 


BP229805 


BP229805 


BP229805 


34 


961 


16.2 


613 


7 


CR630625 


CR630625 


DKFZp459B 


35 


960.5 


16.2 


632 


2 


BE254391 


BE254391 


601109150 


36 


956.5 


16.1 


714 


7 


CN532333 


CN532333 


UI-M-HQ0- 


37 


955 


16.1 


582 


5 


BP219890 


BP219890 


BP219890 


38 


955 


16.1 


1028 


5 


BU139629 


BU139629 


603134795 


39 


954 


16.1 


584 


5 


BP220494 


BP220494 


BP220494 


40 


954 


16.1 


592 


5 


BP307780 


BP307780 


BP307780 


41 


952.5 


16.1 


573 


5 


BP208477 


BP208477 


BP208477 


42 


951 


16.1 


589 


1 


AL705563 


AL705563 


DKFZp686I 


43 


949.5 


16.0 


772 


7 


CF948588 


CF948588 


UI-M-HJ0- 


44 


949 


16.0 


592 


7 


CN429731 


CN429731 


170004245 


45 


948 


16.0 


582 


5 


BP310483 


BP310483 


BP310483 



ALIGNMENTS 



GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - nucleic search, using f rame_plus_p2n model 



Run on: 



April 16, 2005, 11:30:21 ; Search time 2310.43 Seconds 

(without alignments) 
3963.787 Million cell updates/sec 



Title: 

Perfect score: 
Sequence: 



US-09-830-972-2_COPY_975_1163 
925 

1 S WDLLYWRDI KKTGWFGA 



VKDAMAKIQAKI PGLKRKAD 189 



Scoring table: 



BLOSUM62 



Xgapop 10.0 , Xgapext 

Ygapop 10.0 , Ygapext 

Fgapop 6.0 , Fgapext 

Delop 6.0 , Delext 



0.5 
0.5 
7.0 
7.0 



Searched: 



4708233 seqs, 24227607955 residues 



Total number of hits satisfying chosen parameters: 



9416466 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Command line parameters : 
-MODEL=frame+_p2n. model -DEV=xlh 

Q=/cgn2_l/USPTO_spool_h/US09830972/runat_14042005_094842_18348/app_query. fasta_l 
.3683 

-DB-GenEmbl -QFMT=fastap -SUFFIX=rge -MINMATCH=0 . 1 -LOOPCL=0 -LOOPEXT=0* 
-UNITS=bits -START=1 -END=-1 -MATRIX=blosum62 -TRANS=human40 . cdi -LIST=45 
-DOCALIGN=200 -THR_SCORE=pct -THR_MAX=100 -THR_MIN=0 -ALIGN=15 -MODE=LOCAL 
-OUTFMT=pto -NORM=ext -HEAPSIZE=500 -MINLEN=0 -MAXLEN=2 000000000 
-USER=US09830972_@CGN_l_l_19954_@runat_14042005_094842_18348 -NCPU=6 -ICPU=3 
-NO_MMAP -LARGEQUERY -NEG_SCORES=0 -WAIT -DSPBLOCK-100 -LONGLOG 

-DEV_TIMEOUT=120 -WARN_TIMEOUT=30 -THREADS=1 -XGAPOP=10 -XGAPEXT=0 . 5 -FGAPOP=6 
-FGAPEXT =7 -YGAPOP=10 -YGAPEXT=0 . 5 -DELOP=6 -DELEXT=7 

Database : GenEmbl : * 



1 
2 
3 
4 
5 
6 
7 
8 
9 



gb_ba : * 
gb_htg : * 
gb_in : * 
gb__om : * 
gb_o v : * 
gb_pat : * 
gb_ph : * 
gb_pl : * 
gb_pr : * 
gb_ro : * 



10: 



11: gb_sts:* 
12: gb_sy:* 
1 3 : gb_un : * 
14: gb_vi:* 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES . 

% 

Result Query 

No. Score Match Length DB ID Description 



1 


925 


100 


. 0 


3489 


6 


AX766046 


AX766046 Sequence 


2 


925 


100 


. 0 


. 3492 


6 


CQ829507 


CQ829507 Sequence 


3 


925 


100 


. 0 


4684 


10 


RN0242961 


AJ242961 Rattus no 


4 


922 


99 


. 7 


2782 


6 


AX700396 


AX700396 Sequence 


5 


922 


99 


.7 


2782 


10 


AY164741 


AY164741 Rattus no 


6 


922 


99 


.7 


2782 


10 


AF132045 


AF132045 Rattus no 


7 


921 


99 


. 6 


764 


10 


AF051335 


AF051335 Rattus no 


8 


921 


99 


. 6 


1568 


10 


RN0242963 


AJ242963 Rattus no 


9 


921 


99 


. 6 


2156 


10 


AY164740 


AY164740 Rattus no 


10 


921 


99 


. 6 


2156 


10 


RN0242962 


AJ242962 Rattus no 


11 


921 


99 


. 6 


2410 


10 


BC070879 


BC070879 Rattus no 


12 


917 


99 


. 1 


1738 


10 


AB073672 


AB073672 Mus muscu 


13 


917 


99 


. 1 


2958 


10 


BC032192 


BC032192 Mus muscu 


14 


917 


99 


. 1 


3815 


10 


BC032272 


BC032272 Mus muscu 


15 


917 


99 


. 1 


4063 


10 


AY102280 


AY102280 Mus muscu 


16 


917 


99 


. 1 


4518 


10 


BC056373 


BC056373 Mus muscu 


17 


917 


99 


. 1 


4627 


10 


AY102284 


AY102284 Mus muscu 


18 


914 


98 


. 8 


2266 


10 


AY102282 


AY102282 Mus muscu 


19 


913 


98 


. 7 


734 


10 


AF326337 


AF32 6337 Mus muscu 


20 


. 913 


98 


. 7 


1769 


10 


AY102283 


AY1022 83 Mus muscu 


21 


913 


98 


. 7 


2209 


10 


AY1022 81 


AYlOz^ol Mus muscu 


22 


908 


98 


. 2 


2481 


9 


AF063601 


AF063601 Homo sapi 


23 


908 


98 


. 2 


2883 


9 


tv n *\ o r\ c\ c\ r\ 

AF32Q999 


Aro^iuyyy nomo sapi 


24 


908 


98 


. 2 


3491 


9 


AF333336 


AF333336 Homo sapi 


25 


908 


98 


.2 


3576 


6 


AX766050 


AX766050 Sequence 


26 


908 


98 


.2 


3579 


6 


BD249446 


BD249446 Protein s 


27 


908 


98 


.2 


3579 


9 


HSA251383 


AJ251383 Homo sapi 


28 


908 


98 


.2 


3919 


6 


CQ829486 


CQ829486 Sequence 


29 


908 


98 


.2 


4053 


6 


AX195249 


AX195249 Sequence 


30 


908 


98 


.2 


4053 


9 


AB020693 


AB020693 Homo sapi 


31 


908 


98 


.2 


4060 


9 


AY123250 


AY123250 Homo sapi 


32 


908 


98 


.2 


4070 


9 


AY123249 


AY12324 9 Homo sapi 


33 


908 


98 


.2 


4093 


6 


BD270070 


BD270070 Secreted 


34 


908 


98 


.2 


4102 


9 


AY123245 


AY123245 Homo sapi 


35 


908 


98 


.2 


4109 


9 


AY123248 


AY123248 Homo sapi 


36 


908 


98 


.2 


4123 


9 


AY123247 


AY123247 Homo sapi 


37 


908 


98 


.2 


4160 


9 


AY123246 


AY123246 Homo sapi 


38 


908 


98 


.2 


4166 


9 


AB040462 


AB040462 Homo sapi 


39 


908 


98 


.2 


4632 


9 


AF148537 


AF148537 Homo sapi 


40 


908 


98 


.2 


4789' 


6 


CQ874017 


CQ874017 Sequence 


41 


908 


98 


.2 


4789 


9 


AY102279 


AY102279 Homo sapi 


42 


908 


98 


.2 


4822 


6 


AR220865 


AR220865 Sequence 


43 


906.5 


98 


.0 


3821 


10 


AY114152 


AY114152 Mus muscu 



44 905 97.8 1784 9 BC016165 

45 905 97.8 2389 9 AY102278 



BC016165 Homo sapi 
AY102278 Homo sapi 



ALIGNMENTS 



RESULT 1 
AX766046 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 

FEATURES 

source 



exon 
ORIGIN 



linear PAT 25-JUN-2003 



AX766046 3489 bp DNA 

Sequence 1 from Patent WO03002602. 
AX766046 

AX766046.1 GI: 32260128 



Rattus norvegicus (Norway rat) 
Rattus norvegicus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; 
Rattus . 
1 

Eisenbach-Schwartz, M. and Hauben,E. 

Nogo and nogo receptor derived peptides for t-cell mediated 
neuroprotection 

Patent: WO 03002602-A 1 09-JAN-2003; 

YEDA RESEARCH AND DEVELOPMENT Co. LTD. (IL) 

Location/ Qualifiers 

1. .3489 

/organism="Rattus norvegicus" 
/mol_type="unas signed DNA ,f 
/db_xref="taxon: 10116" 
1. .3489 



Alignment Scores: 

Pred. No. : 2. 82e-85 

Score: 925.00 

Percent Similarity: 100.00% 
Best Local Similarity: 100.00% 

Query Match: 100.00% 

DB: 6 



Length: 3489 

Matches: 189 

Conservative: 0 

Mismatches: 0 

Indels : 0 

Gaps : 0 



US-09-830-972-2_COPY_975_1163 (1-189) x AX766046 (1-3489) 

Qy 1 SerValValAspLeuLeuTyrTrpArgAspIleLysLysThrGlyValValPheGlyAla 20 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I 
Db 2923 TCAGTTGTTGACCTCCTCTACTGGAGAGACATTAAGAAGACTGGAGTGGTGTTTGGTGCC 2982 

Qy 21 SerLeuPheLeuLeuLeuSerLeuThrValPheSerlleValSerValThrAlaTyrlle 40 

I I LI I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I II I II II I II M I II I I 
Db 2983 AGCTTATTCCTGCTGCTGTCTCTGACAGTGTTCAGCATTGTCAGTGTAACGGCCTACATT 3042 

Qy 41 AlaLeuAlaLeuLeuSerValThrlleSerPheArglleTyrLysGlyVallleGlnAla 60 

I I I I I I I II I I II I I I I I II I I I I I M I I I I I I I I I I I I I I I II I I I I I I I I I II I I II I 
Db 3043 GCCTTGGCCCTGCTCTCGGTGACTATCAGCTTTAGGATATATAAGGGCGTGATCCAGGCT 3102 



Qy 61 IleGlnLysSerAspGluGlyHisProPheArgAlaTyrLeuGluSerGluValAlalle 80 

I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I 
Db 3103 AT C CAGAAATCAGAT GAAGGC CAC C C ATT C AGGGCAT AT T T AGAAT CT GAAGTT GCT AT A 3162 



GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - nucleic search, using f rame_plus_p2n model 



Run on: 



April 16, 2005, 11:29:46 ; Search time 267.442 Seconds 

(without alignments) 
4183.459 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 



US-09-830-972-2_COPY_975_1163 
925 

1 S WDLLYWRDI KKTGWFGA 



VKDAMAKIQAKIPGLKRKAD 189 



Scoring table: 



BLOSUM62 



Xgapop 10.0 
Ygapbp 10.0 
Fgapop 6 . 0 
Delop 6.0 



Xgapext 
Ygapext 
Fgapext 
Delext 



0.5 
0.5 
7.0 
7.0 



Searched: 



4390206 seqs, 2959870667 residues 



Total number of hits satisfying chosen parameters: 



8780412 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Command line parameters : 
-MODEL=frame+_p2n. model ~DEV=xlh 

Q=/cgn2_l/USPTO_spool_h/US09830972/runat_14042005_094842_18342/app_query. fasta_l 
.3683 

-DB=N_Geneseq -QFMT=fastap -SUFFIX=rng -MINMATCH=0 . 1 -LOOPCL=0 -LOOPEXT=0 
-UNITS=bits -START=1 -END=-1 -MATRIX=blosum62 -TRANS -human 4 0 . cdi -LIST=45 
-DOCALIGN=200 -THR_SCORE=pct -THR_MAX=100 -THR_MIN=0 -ALIGN=15 -MODE=LOCAL 
-OUTFMT=pto -NORM=ext -HEAPSIZE=500 -MINLEN=0 -MAXLEN=2 000000000 
-USER=US09830972_@CGN_l_l_1955_@runat_14042005_094 842_18342 -NCPU=6 -ICPU=3 
-NO_MMAP -LARGEQUERY -NEG_SCORES=0 -WAIT -DSPBLOCK=100 -LONGLOG 

-DEV_TIMEOUT=120 -WARN_TIMEOUT=30 -THREADS=1 -XGAPOP=10 -XGAPEXT=0 . 5 -FGAPOP=6 
-FGAPEXT=7 -YGAPOP=10 -YGAPEXT=0 . 5 -DELOP=6 -DELEXT=7 

Database : N_Geneseq_16Dec04 : * 



1: 
2: 
3: 
4: 
5: 
6: 
7: 
8: 
9: 
10 



geneseqnl980s : * 
geneseqnl990s : * 
geneseqn2000s : * 
geneseqn2001as : * 
geneseqn2001bs : * 
geneseqn2002as : * 
geneseqn2002bs : * 
geneseqn2003as : * 
geneseqn2003bs : * 
geneseqn2003cs : * 



11 : geneseqn2003ds : * 
12 : geneseqn2004as : * 
13 : geneseqn2004bs : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 



No. 


Score 


Match 


Length 


DB 


ID 


Description 


1 


925 


100. 


0 


3492 


12 


ADP45571 


Adp45571 Rat NogoA 


2 


925 


100. 


0 


4684 


3 


AAD01173 


Aad01173 Rat neuri 


3 


925 


100. 


0 


4684 


6 


ABN86600 


Abn86600 Rat neuro 


4 


922 


99. 


7 


2782 


10 


ADB85284 


Adb85284 Rat fooce 


5 


921 


99. 


6 


1568 


3 


AAD01175 


Aad01175 Rat neuri 


6 


913 


98. 


7 


734 


12 


ADF42781 


Adf42781 Mouse CYP 


7 


908 


98. 


2 


3579 


3 


AAZ56886 


Aaz56886 Human MAG 


8 


908 


98. 


2 


3579 


4 


AAF90324 


Aaf90324 Human NOG 


9 


908 


98. 


2 


3579 


6 


ABK90134 


Abk90134 DNA encod 


10 


908 


98. 


2 


3579 


6 


ABN86601 


Abn86601 Human neu 


11 


908 


98. 


2 


3579 


12 


ADO07886 


Ado07886 Human pol 


12 


908 


98. 


2 


3579 


12 


ADR13965 


Adrl3965 Human NOG 


13 


908 


98. 


2 


3833 


3 


AAD01174 


Aad01174 Bovine ne 


14 


908 


98. 


2 


3919 


12 


ADP45550 


Adp45550 Human Nog 


15 


908 


98. 


2 


4053 


4 


AAS09453 


Aas09453 Human cDN 


16 


908 


98. 


2 


4053 


9 


ACC81048 


Acc81048 Human Nog 


17 


908 


98. 


2 


4053 


12 


ADP13574 


Adpl3574 Renal eel 


18 


908 


98. 


2 


4093 


3 


AAA23454 


Aaa23454 cDNA enco 


19 


908 


98. 


2 


4632 


6 


ABV94680 


Abv94 680 Human pan 


20 


908 


98. 


2 


4632 


10 


ADG32772 


Adg32772 Human DNA 


21 


908 


98. 


2 


4698 


8 


ABX34563 


Abx34563 Human mdd 


22 


908 


98. 


2 


4789 


13 


ADR83534 


Adr83534 Human ret 


23 


908 


98. 


2 


4822 


6 


ABS70449 


Abs70449 Human bon 


24 


906. 5 


98. 


0 


3492 


12 


ADO07888 


Ado07888 Mouse pol 


c 25 


905 


97. 


8 


1758 


4 


AAF32725 


Aaf32725 Human sec 


26 


904 


97. 


7 


600 


4 


AAF90323 


Aaf90323 Human NOG 


27 


904 


97. 


7 


770 


3 


AAA7 2983 


Aaa72983 Human NSP 


28 


904 


97. 


7 


799 


2 


AAV23695 


Aav23695 Human NSP 


29 


904 


97. 


7 


1122 


3 


AAZ56888 


Aaz56888 Human MAG 


30 


904 


97. 


7 


1122 


4 


AAF90325 


Aaf90325 Human NOG 


31 


904 


97. 


7 


1213 


2 


AAX04379 


Aax04379 Human sec 


32 


904 


97. 


7 


1216 


6 


ABA05903 


Aba05903 Human RTN 


33 


904 


97. 


7 


1599 


10 


ADI62860 


Adi62860 Human apo 


34 


904 


97. 


7 


1610 


3 


AAZ36230 


Aaz36230 cDNA enco 


35 


904 


97. 


7 


1785 


12 


ADK14166 


Adkl4166 Human aut 


36 


904 


97. 


7 


2052 


6 


ABK90133 


Abk90133 DNA encod 


37 


904 


97. 


7 


2235 


6 


ABV94681 


Abv94681 Human pan 


38 


904 


97. 


7 


2240 


3 


AAC64406 


Aac64 4 06 Human Nog 


39 


900 


97. 


3 


1694 


4 


AAK94408 


Aak94408 Human ful 


40 


900 


97. 


3 


1694 


12 


ADL31137 


Adl31137 Full leng 


41 


898 


97. 


1 


1980 


4 


AAI98079 


Aai98079 Human neu 


42 


895 


96. 


8 


991 


2 


AAX97587 


Aax97587 Extended 


43 


895 


96. 


8 


994 


12 


ADP18854 


Adpl8854 Human sec 


44 


891 


96. 


3 


2610 


11 


ADI31056 


Adi31056 Human cDN 



45 871.5 94.2 2386 2 AAV30920 



Aav30920 Human sec 



ALIGNMENTS 

RESULT 1 
ADP45571 

ID ADP45571 standard; cDNA; 3492 BP. 
XX 

AC ADP45571; 
XX 

DT 09-SEP-2004 (first entry) 
XX 

DE Rat NogoA encoding cDNA SEQ ID NO: 25. 
XX 

KW binding molecule; human; NogoA; NiG; NiG-D20; NogoA__623-640 ; 

KW nerve repair; neuroprotective; gene therapy; 

KW central nervous system injury; CNS injury; neurodegenerative disorder; 

KW rat; gene; ss. 

XX 

OS Rattus norvegicus . 
XX 

FH Key Location/Qualif iers 

FT CDS 1. ..3492 

FT /*tag= a 

FT /product^ "NogoA" 

XX 

PN WO2004052932-A2. 
XX 

PD 24-JUN-2004. 
XX 

PF 09-DEC-2003; 2003WO-EP013960 . 
XX 

PR 10-DEC-2002; 2002GB-00028832 . . 
XX 

PA (NOVS ) NOVARTIS AG. 

PA (NOVS ) NOVARTIS PHARMA GMBH. 

PA (UYZU-) UNIV ZUERICH. 

XX 

PI Barske C, Mir AK, Oertle T, Schnell L, Schwab ME, Vitaliti A; 

PI Zurini M; 

XX 

DR WPI; 2004-468818/44. 

DR P-PSDB; ADP45572. 
XX 

PT New binding molecule that binds to the human NogoA polypeptide, NiG, NiG- 

PT D20 or NogoA623-640, useful in preparing a composition for treating CNS 

PT injury or neurodegenerative disorders. 
XX 

PS Disclosure; SEQ ID NO 25; 121pp; English. 
XX 

CC The present invention describes a binding molecule which binds to human 

CC NogoA polypeptide, human NiG, human NiG-D20 or human NogoA_623-64 0 with a 

CC dissociation constant of less than lOOOnM. Also described: (1) a 

CC polynucleotide encoding the binding molecule; (2) an expression vector or 

CC system comprising the polynucleotide; (3) a host cell comprising the 

CC expression system; (4) a pharmaceutical composition comprising the 



GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - nucleic search, using f rame_plus_p2n model 

Run on: April 16, 2005, 11:35:57 ; Search time 82.5388 Seconds 

(without alignments) 
3746.799 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



US-09-830-972-2_COPY_975_1163 
925 

1 SWDLLYWRDIKKTGWFGA VKDAMAKIQAKI PGLKRKAD 189 

BLOSUM62 

Xgapop 10.0 , Xgapext 0.5 
Ygapop 10.0 , Ygapext 0.5 
Fgapop 6.0 , Fgapext 7.0 
Delop 6.0 , Delext 7.0 



Searched : 



1202784 seqs, 818138359 residues 



Total number of hits satisfying chosen parameters: 2405568 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Command line parameters : 
-MODEL=frame+_p2n. model -DEV=xlh 

Q=/cgn2_l/USPTO_spool_h/US09830972/runat_14042005_094844_18372/app_query. fasta_l 
.3683 

-DB=Issued_Patents_NA -QFMT=fastap -SUFFIX=rni -MINMATCH=0 . 1 -LOOPCL=0 
-LOOPEXT=0 -UNITS=bits -START-1 -END=-1 -MATRIX=blosum62 -TRANS=human40 . cdi 
-LIST=45 -DOCALIGN=200 -THR_SCORE=pct -THR_MAX=100 -THR_MIN=0 -ALIGN=15 
-MODE-LOCAL -OUTFMT=pto -NORM=ext -HEAPSIZE=500 -MINLEN—0 -MAXLEN=2000000000 
-USER=US09830972_@CGN_l_l_255_@runat_14042005_094844_18372 -NCPU-6 -ICPU-3 
-NO_MMAP -LARGEQUERY -NEG_SCORES=0 -WAIT -DSPBLOCK=100 -LONGLOG 

-DEV_TIMEOUT=120 -WARN_TIMEOUT=30 -THREADS=1 -XGAPOP=10 -XGAPEXT=0 . 5 -FGAPOP=6 
-FGAPEXT =7 -YGAPOP=10 -YGAPEXT=0.5 -DELOP=6 -DELEXT=7 



Database 



Issued Patents NA:* 
1 
2 
3 
4 
5 
6 



/cgn2_6/ptodata/l/ina/5A_COMB. seq: * 
/cgn2_6/ptodata/l/ina/5B_COMB. seq: * 
/cgn2_6/ptodata/l/ina/6A_COMB. seq: * 
/cgn2_6/ptodata/l/ina/6B_COMB. seq: * 
/cgn2_6/ptodata/l/ina/PCTUS_COMB. seq: 
/cgn2_6/ptodata/l/ina/backf ilesl . seq: 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 







Score 


Match 


T a. >*» V\ 

Lengufi 


JJD 


t n 

J. L> 








Description 




1 

1 


q n q 


98. 


2 


A QOO 
4 OZZ 


3 


no 
Uo — 


uy — 


A Q A _ 
4 o4 — 


y / uo iuo 


Sequence 


106, App 






q n a 
y U4 


97. 


7 


/ y y 




TTC* 

Uo — 


n q _ 

Uo 


7nn_ 
/ uu 


DU / 6. 


Sequence 


2, Appli 




3 


y U4 


97. 


7. 


i boy 


yi 
4 


Uo- 


uy- 


Q A Q 

y 4 y — 




Sequence 


3253, Ap 




4 


£3 G 1 
Q 91 


96. 


3 


z oil) 


4 


TTC 

Uo- 


uy- 


HOT 


boo— 3 


Sequence 


382, App 




0 


/* *7 Q 

b / y 


73. 


4 


ziUoy 


yi 
4 


TT C 

Uo — 


uy — 


y 4 y — 


u i b oouy 


Sequence 


3309, Ap 




6 


6/9 


73. 


4 


3^ UZ 


A 

4 


TTC* 

Uo- 


uy- 


Q /I Q 

y 4 y- 


m c ii oi 
Ulb— ll*i / 


Sequence 


1127, Ap 




/ 


625 . 5 


67. 


6 


2z 62 


4 


T T C 

Uo- 


uy- 


y 4y- 


nit: oqqo 
U 1 b— z y 0 0 


Sequence 


2988, Ap 




8 


622 . 5 


67. 


3 


1/66 


3 


us- 


Uy- 


149- 


4 / b~zo4 


Sequence 


254, App 




9 


f o o c 

622 . 5 


67. 


3 


'-i f f A 

2664 


3 


US- 


09- 


149- 


ah c o c: c 
4 / b-^oo 


Sequence 


255, App 




10 


C yl 1 

541 


58. 


5 


351 / 


yt 
4 


US- 


uy- 


t n q 

/ y y- 


yl C 1 111 

4 01—111 


Sequence 


111, App 




11 


524.5 


56. 


7 


t n n c 
1095 


2 


us- 


n o 

Oo- 


t n n 

/00- 


c c\ n a 
oU /-4 


Sequence 


4, Appli 




12 


518 


56. 


0 


yt 

2014 


4 


us- 


09- 


o t n 

2 /0- 


/ o /-13obl 


Sequence 


13561, A 




13 


509.5 


55. 


1 


/ 94 


3 


us- 


09- 


T yl n 

149- 


4 / b-lUZ 


Sequence 


102, App 




14 


4 /5 


51. 


4 


2181 


yi 
4 


us- 


09- 


y4y- 


01b-14iy 


Sequence 


1419, Ap 


c 


Id 


341 


36. 


9 


601 


yi 
4 


us- 


uy- 


y4y- 


Ulb-11 / 00 0 


Sequence 


117588, 


c 


16 


341 


36. 


9 


601 


4 


US- 


09- 


949- 


016-11 /oo9 


Sequence 


117589, 




17 


341 


36. 


9 


42075 


4 


US- 


09- 


949- 


016-14995 


Sequence 


14995, A 




18 


286 


30. 


9 


443 


4 


US- 


09- 


513- 


999C-3 / 84 


Sequence 


3784, Ap 




19 


282.5 


30. 


5 


13566 / 


4 


US- 


09- 


949- 


016- loOol 


Sequence 


15051, A 




20 


282 . 5 


30. 


5 


152486 


A 

4 


us- 


09- 


fi yi n 
949- 


016-128 69 


Sequence 


12869, A 


c 


21 


278 . 5 


30. 


1 


601 


4 


US- 


09- 


949- 


Air yini/TO 

016-40169 


Sequence 


40169, A 


c 


22 


278.5 


30. 


1 


601 


A 

4 


US- 


09- 


n yi q 

949- 


01b-liy33o 


Sequence 


119335, 




23 


276 


29. 


8 


261 


2 


US- 


n o 

08- 


700- 


60 /-9 


Sequence 


9, Appli 




24 


271 


29. 


3 


i *3 n f\ a 

13906 


4 


us- 


09- 


C\ A ft 

949- 


016-14 /3U 


Sequence 


14730, A 




25 


248 


26. 


8 


15661 


4 


US- 


09- 


n yi o 

949- 


016-13161 


Sequence 


13161, A 




26 


o yi o 

243 


26. 


3 


o a r\ 
200 


yl 
4 


US- 


09- 


013- 


QQO/"« 111QO 

y y yc- ii iy o 


Sequence 


11198, A 


c 


27 


146 


15. 


8 


601 


4 


us- 


09- 


949- 


016-4017 0 


Sequence 


40170, A 


c 


28 


14 6 


15. 


8 


601 


4 


us- 


09- 


949- 


016-119336 


Sequence 


119336, 




29 


142 


15. 


4 


1125 


4 


US- 


09- 


O A O 

248- 


7 9 6 A- 1905 


Sequence 


1905, Ap 


c 


30 


114 


12. 


3 


601 


4 


US- 


uy- 


949- 


A1C IITCOO 

Ulb-11 / buy 


Sequence 


117609, 




31 


109 


11. 


8 


1 c tz 

2 66 


yl 
4 


us- 


uy- 


313- 


O O /I 7\ TOO 

zy4A— /U3 


Sequence 


703, App 


c 


32 


102 


11. 


0 


601 


4 


us- 


uy- 


y4y- 


Ulb-4oU0 / 


Sequence 


48087, A 




33 


100 


. 10. 


8 


/IOC 

425 


•"5 

3 


us- 


Uo- 


r\ c; 


■^Z3-1 /0 


Sequence 


178, App 




34 


95 


10. 


3 


"1 o o o 
loz 0 


3 


US- 


r* o 
Uo- 


•4 0/- 


oyb— / 


Sequence 


7, Appli 




35 


yo 


10. 


3 


i o o o 


yt 
4 


US- 


Uo- 


fobU- 


■ 4 0 1 A— / 


Sequence 


7, Appli 




O D 


CM 

y 4 


10. 


2 


1Z / -5 


ft 


TTQ — 

u o 




. Q A Q_ 
ft 


UIO 0/^.0 


Sequence 


5725, Ap 


c 


37 


94 


10. 


2 


1722 


4 


us- 


■08- 


956- 


■171E-407 


Sequence 


407, App 


c 


38 


94 


10. 


2 


1722 


4 


us- 


■08- 


-781- 


-986A-407 


Sequence 


407, App 




39 


92.5 


io. 


0 


12566 


3 


us- 


•08- 


961- 


-527-149 


Sequence 


149, App 




40 


88.5 


9. 


6 


4550 


3 


us- 


■09- 


■462- 


■136-1 


Sequence 


1, Appli 




41 


88.5 


9. 


6 


4661 


4 


us- 


•09- 


■949- 


■016-4281 


Sequence 


4281, Ap 




42 


88.5 


9. 


6 


4673 


4 


us- 


-09- 


■814- 


■915A-100 


Sequence 


100, App 




43 


86.5 


9. 


4 


296 


4 


us- 


■09- 


■313- 


•294A-81 


Sequence 


81, Appl 




44 


86.5 


9. 


4 


1779 


4 


us- 


•09- 


134- 


-000C-2072 


Sequence 


2072, Ap 




45 


86 


9. 


3 


1302 


4 


us- 


■08- 


■956- 


■171E-322 


Sequence 


322, App 



ALIGNMENTS 



RESULT 1 

US-09-484-970B-106 



GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - nucleic search, using f rarne_plus_p2n model 



Run on: 



Title: 

Perfect score: 
Sequence : 



April 16, 2005, 11:34:27 ; Search time 1670.87 Seconds 

(without alignments) 
4305.640 Million cell updates/sec 

US-09-830-972-2_COPY_975_1163 
925 

1 SWDLLYWRDIKKTGWFGA VKDAMAKIQAKIPGLKRKAD 189 



Scoring table: BLOSUM62 

Xgapop 10.0 , Xgapext 0.5 

Ygapop 10.0 , Ygapext 0.5 

Fgapop 6.0 , Fgapext 7 . 0 

Delop 6.0 , Delext 7.0 

Searched: 34239544 seqs, 19032134700 residues 

Total number of hits satisfying chosen parameters: 



68479088 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Command line parameters: 
-M0DEL=frame+_p2n. model -DEV=xlh 

Q=/cgn2_l/USPTO_spool_h/US09830972/runat_14042005_094843_18360/app_query. fastaJL 
.3683 

-DB=EST -QFMT=fastap -SUFFIX=rst -MINMATCH=0 . 1 -LOOPCL=0 -LOOPEXT-0 
-UNITS=bits -START=1 -END=-1 -MATRIX=blosum62 -TRANS=human4 0 . cdi -LIST=45 
-DOCALIGN=200 -THRJ5CORE=pct -THR_MAX=100 -THR_MIN=0 -ALIGN=15 -MO DE= LOCAL 
-OUTFMT=pto -NORM=ext -HEAPSIZE=500 -MINLEN=0 -MAXLEN-2 00 0 00 0 0 0 0 
-USER=US09830972_(§CGN_l_l_13747_@runat_14042005_094843_18360 -NCPU=6 -ICPU=3 
-NO_MMAP -LARGEQUERY -NEG_SCORES=0 -WAIT -DSPBLOCK=100 -LONGLOG 

-DEV_TIMEOUT=120 -WARN_TIMEOUT=30 -THREADS=1 -XGAPOP=10 -XGAPEXT=0 . 5 -FGAPOP=6 
-FGAPEXT =7 -YGAPOP=10 -YGAPEXT=0 . 5 -DELOP=6 -DELEXT=7 



Database : 



EST:* 






1: 


gb 


estl : 




2: 


gb_ 


est2 : 




3: 


gb_ 


~htc:* 




4 


gb 


est3 : 




5 


gb 


est4 : 




6 


: gb 


est5 : 




7 


: gb 


est6 : 




8 


: gb 


gssl : 




9 


: gb 


gss2 : 





Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 



No. 


Score 


Match 


Length 


DB 


ID 


Description 


1 


925 


100 . 


o 


779 


7 


CO401465 


CO401465 


AG EN COURT 


2 


921 


99 . 


5 


695 


7 


CV077130 


CV077130 


AGENCOURT 


3 


921 


QQ 


5 


791 


7 


CF977898 


CF977898 


F2 6A06_04 


4 


913 


.7 o • 


7 


600 


9 


AY404972 


AY404972 


Mus muscu 


5 


913 




7 


679 


4 


BI149602 


BI149602 


602848410 


6 


913 


.7 o ■ 


7 


708 


4 


BI157842 


BI157842 


602923001 


7 


907 


■/O • 


1 


794 


1 


AU080127 


AU080127 


AU080127 


8 


907 




\ 


799 


1 


AU080133 


AU080133 


AU080133 


9 


904 


Q7 


7 


598 


7 


CF118424 


CF118424 


fs543. zl 


10 


904 


Q7 

_7 / • 


7 


600 


9 


AY404970 


AY404970 


Homo sapi 


11 


904 


Q7 

_7 / • 


7 


650 


6 


CB215381 


CB215381 


NISC np05 


12 


904 


Q7 


7 


667 


7 


CN429712 


CN429712 


170006000 


13 


904 


Q7 


7 


672 


7 


CK977984 


CK977984 


4109431 B 


14 


904 


Q7 


7 


682 


6 


CB162885 


CB162885 


K-EST0223 


15 


904 


Q7 


7 


712 


7 


CK971318 


CK971318 


4087182 B 


16 


904 


Q7 


7 


743 


6 


CD102817 


CD102817 


AGENCOURT 


17 


904 


Q7 

Z7 / * 


7 


758 


4 


BG697436 


BG697436 


602660623 


18 


904 


Q7 


7 


788 


1 


AL533461 


AL533461 


AL533461 


19 


904 


Q7 


7 


843 


4 


BG570231 


BG570231 


602590632 


20- 


904 


Q7 


7 


849 


7 


CR765672 


CR765672 


DKFZp469C 


c 21 


904 


Q7 


7 


875 


1 


AL573494 


AL573494 


AL573494 


22 


904 


Q7 


7 


1540 


3 


CR611869 


CR611869 


f ull-leng 


23 


904 


Q7 


7 


1785 


3 


AF077050 


AF077050 


Homo sapi 


24 


903 


Q7 

_7 / < 




781 


4 


BI079496 


BI079496 


602876306 


25 


901 


Q7 


. T 


730 


1 


AU297347 


AU297347 


AU297347 


26 


898 


Q7 

-7 / i 


1 

> X 


634 


6 


CB067821 


CB067821 


iq38a06.y 


27 


898 


Q7 

_7 / < 


1 

> X 


670 


7 


CV030029 


CV030029 


9024 Full 


28 


898 


Q7 

-7 / i 


1 

> X 


960 


7 


CN646472 


CN646472 


ILLUMIGEN 


29 


898 


Q7 


( 1 


983 


7 


CN803408 


CN803408 


ILLUMIGEN 


30 


898 


97 


. 1 


1031 


7 


CN647521 


CN647521 


ILLUMIGEN 


31 


898 


97 


. 1 


3533 


3 


AK034902 


AK034902 


Mus muscu 


32 


895 


96 


.8 


990 


4 


BI691132 


BI691132 


603314519 


33 


894 


96 


.6 


751 


7 


CK357937 


CK357937 


AGENCOURT 


34 


893 


96 


.5 


718 


1 


AA986233 


AA986233 


uc73gl2 .y 


35 


892 


96 


.4 


592 


7 


CN482802 


CN482802 


hw24hl2.y 


36 


892 


96 


.4 


805 


7 


C0735185 


C0735185 


SlLE04clO 


37 


892 


96 


.4 


958 


4 


BM801698 


BM801698 


AGENCOURT 


38 


891 


96 


.3 


749 


4 


BG296048 


BG296048 


602393712 


39 


891 


96 


.3 


757 


4 


BG715173 


BG715173 


602675631 


40 


890 


96 


.2 


924 


5 


BU845601 


BU845601 


AGENCOURT 


41 


890 


96 


.2 


1042 


7 


CN805577 


CN805577 


ILLUMIGEN 


42 


889 


96 


.1 


871 


6 


CD110203 


CD110203 


AGENCOURT 


43 


887 


95 


.9 


921 


5 . 


BU503291 


BU503291 


AGENCOURT 


44 


885 


95 


.7 


1013 


7 


CO048918 


CO048918 


ILLUMIGEN 


45 


885 


95 


.7 


1018 


7 


CN801888 


CN801888 


ILLUMIGEN 



ALIGNMENTS 



GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - nucleic search, using f rame_plus_p2n model 



Run on: 



April 16, 2005, 22:42:49 



Search time 290.373 Seconds 

(without alignments) 

3948.747 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 



US-09-830-972-2_COPY_975_1163 
925 

1 SWDLLYWRDIKKTGWFGA 



VKDAMAKIQAKI PGLKRKAD 189 



Scoring table: 



BLOSUM62 



Xgapop 10.0 , Xgapext 

Ygapop 10.0 , Ygapext 

Fgapop 6.0 , Fgapext 

Delop 6.0 , Delext 



0.5 
0.5 
7.0 
7.0 



Searched: 



5622541 seqs, 3033355566 residues 



Total number of hits satisfying chosen parameters: 



11245082 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Command line parameters : 
-MODEL=frame+_p2n. model -DEV=xlh 

Q=/cgn2_l/USPTO_spool_h/US09830972/runat_14042005_094847_184 66/app_query. fasta_l 
.3683 

-DB=Published_Applications_NA -QFMT=fastap -SUFFIX=rnpb -MINMATCH=0 . 1 
-LOOPCL=0 -LOOPEXT=0 -UNITS=bits -START=1 -END=-1 -MATRIX=blosum62 . 
-TRANS=human4 0.cdi -LIST=45 -DOCALIGN-2 00 -THR_SCORE=pct -THR__MAX=100 
-THR_MIN=0 -ALIGN=15 - MO DE= LOCAL -OUTFMT=pto -NORM-ext -HEAPSIZE=500 -MINLEN=0 
-MAXLEN=2000000000 -USER=US09830972_@CGN_1_1_2 092_@runat_14 042 005__094 8 4 7_184 66 
-NCPU=6 -ICPU=3 -NO_MMAP -LARGEQUERY -NEG_SCORES=0 -WAIT -DSPBLOCK=100 
-LONGLOG -DEV_TIMEOUT=120 -WARN_TIMEOUT=30 -THREADS=1 -XGAPOP=10 -XGAPEXT=0 . 5 
-FGAPOP=6 -FGAPEXT =7 -YGAPOP=10 -YGAPEXT=0 . 5 -DELOP=6 -DELEXT=7 



1: /cgn2_6/ptodata/2/pubpna/US07_PUBCOMB. seq:* 

2: /cgn2_6/ptodata/2/pubpna/PCT_NEW_PUB. seq:* 

3: /cgn2_6/ptodata/2/pubpna/US06_NEW_PUB.seq: * 

4 : /cgn2_6/ptodata/2/pubpna/US06_PUBCOMB. seq: * 

5: /cgn2_6/ptodata/2/pubpna/US07_NEW_PUB.seq: + 

6: /cgn2_6/ptodata/2/pubpna/PCTUS_PUBCOMB.seq:* 

7 : /cgn2_6/ptodata/2/pubpna/US08_NEW_PUB . seq: * 

8: /cgn2_6/ptodata/2/pubpna/US08_PUBCOMB.seq:* 

9: /cgn2_6/ptodata/2/pubpna/US09A_PUBCOMB.seq:* 

10: /cgn2_6/ptodata/2/pubpna/US09B_PUBCOMB.seq:* 



Database 



Published_Applications_NA: * 



11: /cgn2_6/ptodata/2/pubpna/US09C_PUBCOMB. seq: * 

12: /cgn2_6/ptodata/2/pubpna/US09_NEW_PUB.seq: * 

13: /cgn2_6/ptodata/2/pubpna/US10A_PUBCOMB. seq: * 

14 : /cgn2_6/ptodata/2/pubpna/US10B_PUBCOMB. seq: * 

15: /cgn2_6/ptodata/2/pubpna/US10C_PUBCOMB.seq:* 

16: /cgn2_6/ptodata/2/pubpna/US10D_PUBCOMB. seq:* 

17 : /cgn2_6/ptodata/2/pubpna/US10E_PUBCOMB. seq: * 

18: /cgn2_6/ptodata/2/pubpna/US10F_PUBCOMB.seq:* 

19 : /cgn2_6/ptodata/2/pubpna/US10_NEW_PUB. seq: * 

20: /cgn2_6/ptodata/2/pubpna/USll_NEW_PUB. seq: * 

21 : /cgn2_6/ptodata/2/pubpna/US60_NEW_PUB. seq: * 

22 : /cgn2_6/ptodata/2/pubpna/US60_PUBCOMB. seq: * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 



Result 
No. 


Score 


Query 
Match 


Length 


DB 


ID 


Description 


1 


925 


100 


.0 


4684 


9 


US-09-893-348-17 


Sequence 17, Appl 


2 


925 


100 


.0 


4684 


18 


US-10-810-653-17 


Sequence 17, Appl 


3 


922 


99 


.7 


2782 


15 


US-10-205-194-165 


Sequence 165, App 


4 


908 


98 


.2 


3579 


9 


US-09-789-386-1 


Sequence 1, Appli 


5 


908 


98 


.2 


3579 


9 


US-09-893-348-22 


Sequence 22, Appl 


6 


908 


98 


.2 


3579 


17 


US-10-267-502-212 


Sequence 212, App 


7 


908 


98 


.2 


3579 


18 


US-10-327-213-8 


Sequence 8, Appli 


8 


908 


98,2 


3579 


18 


US-10-466-258-8 


Sequence 8, Appli 


9 


908 


98 


.2 


3579 


18 


US-10-810-653-22 


Sequence 22, Appl 


10 


908 


98 


.2 


4053 


9 


US-09-758-140-5 


Sequence 5, Appli 


11 


908 


98 


.2 


4053 


9 


US-0 9-972-5 9 9A-5 


Sequence 5, Appli 


12 


908 


98 


.2 


4053 


18 


US-10-717-597-310 


Sequence 310, App 


13 


908 


98 


.2 


4632 


14 


US-10-060-036-53 


Sequence 53, Appl 


14 


906.5 


98 


.0 


3492 


17 


US-10-267-502-214 . 


Sequence 214, App 


15 


904 


97 


.7 


799 


17 


US-10-660-946-2 


Sequence 2, Appli 


16 


904 


97 


.7 


1122 


9 


US-09-789-386-5 


Sequence 5, Appli 


17 


904 


97 


.7 


1160 


14 


US-10-175-523-156 


Sequence 156, App 


18 


904 


97 


.7 


1610 


9 


US-09-765-205-5 


Sequence 5, Appli 


19 


904 


97 


.7 


1785 


17 


US-10-439-388-62 


Sequence 62, Appl 


20 


904 


97 


.7 


2052 


18 


US-10-466-258-3 


Sequence 3, Appli 


21 


904 


97 


.7 


2235 


14 


US-10-060-036-54 


Sequence 54, Appl 


22 


898 


97 


.1 


1980 


17 


US-10-220-891-22 


Sequence 22, Appl 


23 


895 


96 


.8 


994 


11 


US-09-978-360A-110 


Sequence 110, App 


24 


891 


96 


.3 


2610 


17 


US-10-641-643-382 


Sequence 382, App 


25 


864 


93 


.4 


1798 


18 


US-10-466-258-10 


Sequence 10, Appl 


26 


844 


91 


.2 


1514 


9 


US-09-823-245A-349 


Sequence 349, App 


27 


679 


73 


.4 


1520 


15 


US-10-084-817-333 


Sequence 333, App 


28 


679 


73 


.4 


2331 


17 


US-10-267-502-213 


Sequence 213, App 


29 


679 


73 


.4 


3202 


9 


US-09-954-456-210 


Sequence 210, App 


30 


679 


73 


.4 


3202 


17 


US-10-172-118-386 


Sequence 386, App 


31 


679 


73 


.4 


3202 


17 


US-10-342-887-386 


Sequence 386, App 


32 


679 


73 


.4 


3202 


18 


US-10-723-860-1480 


Sequence 1480, Ap 


33 


679 


73 


.4 


3202 


19 


US-10-843-641A-3237 


Sequence 3237, Ap 


34 


679 


73 


.4 


3305 


18 


US-10-723-860-5926 


Sequence 5926, Ap 


35 


678 


73 


.3 


1502 


15 


US-10-205-219-94 


Sequence 94, Appl 



3d 


O / O 


1 O 


Q 




1 "7 
1 / 


ttc in oc7 cno oir 
Uz>- ±V-Zo f — DVZ-Z±D 


Sequence 215, App 


3 1 


o Id 


/ O 


U 




Q 

y 


TTC f\Q QCn ORO Qy|77 

ub-uy— y ou-oo^-ofi / / 


Sequence 8477, Ap 




ODD 


/ 1 


Q 

y 


1 /no 


Id 


ttc in ont^ 1Q/1 1 OQ 


Sequence 128, App 




DJ / 


/ 1 


U 


HZZ 


y 


ttc no Qtf;n qro iirc7 
ub- uy— you— jdz — lido / 


sequence hod/, a 


4 U 




O / 


o 


"711 


1 7 


TTC— 1 Pi — yl Pi ft 




41 


622.5 


67 


3 


1330 


15 


US-10-106-698-1945 


Sequence 1945, Ap 


42 


622.5 


67 


3 


1656 


9 


US-09-729-674-19 


Sequence 19, Appl 


43 


622.5 


67 


3 


1656 


18 


US-10-913-553-19 


Sequence 19, Appl 


44 


622.5 


67 


3 


1668 


9 


US-09-765-205-25 


Sequence 25, Appl 


45 


622.5 


67 


3 


1766 


10 


US-09-809-391-254 


Sequence 254, App 



ALIGNMENTS 

RESULT 1 

US-09-893-348-17 

Sequence 17, Application US/09893348 
Patent No. US20020072493A1 
GENERAL INFORMATION: 
APPLICANT: EI SENBACH- SCHWARTZ , Michal 
COHEN, I run R. 
BESERMAN, Pierre 
MOSONEGO, Alon 
MOALEM, Gila 

ACTIVATED T-CELLS, NERVOUS SYSTEM-SPECIFIC ANTIGENS AND 



APPLICANT: 
APPLICANT: 
APPLICANT: 
APPLICANT: 
TITLE OF INVENTION 
THEIR USES 

FILE REFERENCE: EI S- SCHWARTZ =2 A 
CURRENT APPLICATION NUMBER: US/09/893,348 
CURRENT FILING DATE: 2001-06-28 
PRIOR APPLICATION NUMBER: US 09/314,161 
PRIOR FILING DATE: 1999-05-19 
PRIOR APPLICATION NUMBER: US 09/218,277 
PRIOR FILING DATE: 1998-12-22 
PRIOR APPLICATION NUMBER: PCT/US98 /14715 
PRIOR FILING DATE: 1998-07-21 
PRIOR APPLICATION NUMBER: IL 124500 
PRIOR FILING DATE: 1998-05-19 
NUMBER OF SEQ ID NOS : 29 
SOFTWARE: Patentln version 3.1 
SEQ ID NO 17 
LENGTH: 4 684 
TYPE: DNA 

ORGANISM: Rattus norvegicus 
FEATURE : 
NAME/ KEY: CDS 
LOCATION: ( 253 ) . . ( 374 4 ) 
OTHER INFORMATION: 
US-09-893-348-17 



Alignment Scores: 

Pred. No. : 1.2e-101 

Score: 925.00 

Percent Similarity: 100.00% 
Best Local Similarity: 100.00% 

Query Match: 100.00% 

DB: 9 



Length: 4684 

Matches: 189 

Conservative: 0 

Mismatches: 0 

Indels: 0 

Gaps : 0 



I 



GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - nucleic search, using f rame_plus_p2n model 

Run on: April 16, 2005, 11:29:46 ; Search time 1666.91 Seconds 

(without alignments) 
4183.459 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



US-09-830-972-29 
5923 

1 MEDLDQSPLVSSSDSPPRPQ VKDAMAKIQAKIPGLKRKAE 1178 

BLOSUM62 

Xgapop 10.0 , Xgapext 0.5 
Ygapop 10.0 , Ygapext 0.5 
Fgapop 6.0 , Fgapext 7 . 0 
Delop 6.0 , Delext 7.0 



Searched : 



4390206 seqs, 2959870667 residues 



8780412 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Command line parameters : 
-MODEL=frame+_p2n. model -DEV=xlh 

Q=/cgn2_l/USPTO_spool_h/US09830972/runat_14042005_094842_18342/app_query. fasta_l 
.3683 

-DB=N_Geneseq -QFMT=fastap -SUFFIX=rng -MINMATCH=0 . 1 -LOOPCL=0 -LOOPEXT=0 
-UNITS=bits -START=1 -END=-1 -MATRIX=blosum62 - TRANS =human 40 . cdi -LIST=45 
-DOCALIGN=200 -THR_SCORE=pct -THR_MAX=100 -THR_MIN=0 -ALIGN=15 -MODE=LOCAL 
-OUTFMT=pto -NORM=ext -HEAPSIZE=500 -MINLEN=0 -MAXLEN=2 000000000 
-USER=US09830972_@CGN_l_l_1955_@runat_14 042005_094842_18342 -NCPU=6 -ICPU=3 
-N0_MMAP -LARGEQUERY -NEG_SCORES=0 -WAIT -DSPBLOCK=100 -LONGLOG 

-DEV_TIMEOUT=120 -WARN_TIMEOUT=30 -THREADS=1 -XGAPOP=10 -XGAPEXT=0 . 5 -FGAPOP=6 
-FGAPEXT=7 -YGAPOP=10 -YGAPEXT=0.5 -DELOP=6 -DELEXT=7 



Database 



N_Geneseq_16Dec04 : * 

1: geneseqnl980s : * 

2: geneseqnl990s : * 

3 : geneseqn2000s : * 

4: geneseqn2001as : * 

5: geneseqn2001bs : * 

6: geneseqn2002as: * 

7 : geneseqn2002bs : * 

8 : geneseqn2003as : * 

9 : geneseqn2003bs : * 

10 : geneseqn2003cs : * 



11 : geneseqn2003ds : * 
12: geneseqn2004as : * 
13 : geneseqn2004bs : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 



o. 


Score 


Match 


Length DB 


ID 


Description 


1 


5815 


98. 


2 


3919 


12 


ADP45550 


Adp45550 Human Nog 


2 


5815 


98. 


2 


4053 


4 


AAS09453 


Aas09453 Human cDN 


3 


5815 


98. 


2 


4053 


9 


ACC81048 


Acc81048 Human Nog 


4 


5815 


98. 


2 


4053 


12 


ADP13574 


Adpl3574 Renal eel 


5 


5815 


98 . 


2 


4632 


6 


ABV94680 


Abv94 680 Human pan 


6 


5815 


98. 


2 


4632 


10 


ADG32772 


Adg32772 Human DNA 


7 


5815 


98. 


2 


4789 


13 


ADR83534 


Adr83534 Human ret 


8 


5810 


98. 


1 


3579 


3 


AAZ56886 


Aaz56886 Human MAG 


9 


5810 


98. 


1 


3579 


4 


AAF90324 


Aaf90324 Human NOG 


10 


5810 


98. 


1 


3579 


6 


ABK90134 


Abk90134 DNA encod 


11 


5810 


98. 


1 


3579 


6 


ABN86601 


Abn86601 Human neu 


12 


5810 


98. 


1 


3579 


12 


ADO07886 


Ado07886 Human pol 


13 


5810 


98. 


1 


3579 


12 


ADR13965 


Adrl3965 Human NOG 


14 


5788 


97. 


7 


4093 


3 


AAA23454 


Aaa23454 cDNA enco 


15 


5780.5 


97. 


6 


4698 


8 


ABX34563 


Abx34 563 Human mdd 


16 


5748 


97. 


0 


4822^. 


.6. . 


ABS70449 " 


Abs70449 Human bon 


17 


4296. 5 


72. 


5 


3492 


12 


ADP45571 


Adp45571 Rat NogoA 


18 


4296.5 


72. 


5 


4684 


3 


AAD01173 


Aad01173 Rat neuri 


19 


4296. 5 


72. 


5 


4684 


6 


ABN86600 


Abn86600 Rat neuro 


20 


4277.5 


72. 


2 


3492 


12 


ADO07888 


Ado07888 Mouse pol 


21 


3798.5 


64. 


1 


3833 


3 


AAD01174 


Aad01174 Bovine ne 


22 


3306. 5 


55. 


8 


2386 


2 


AAV30920 


Aav30920 Human sec 


23 


3297.5 


55. 


7 


2386 


5 


AAF98399 


Aaf98399 Human cDN 


24 


2731 


46. 


1 


60615 


13 


ADT89536 


Adt89536 Mus muscu 


25 


2593 . 5 


43. 


8 


2425 


12 


AD026412 


Ado26412 Rat trunc 


26 


2511 . 5 


42. 


4 


1980 


4 


AAI 98079 


Aai98079 Human neu 


2 7 


2425.5 


41. 


0 


2248 


12 


7\ noo C A 1 1 

ADOZ b4 1 1 


Ado^Dfiii Kar. trunc 


28 


2425.5 


41. 


0 


2278 


12 


AD026413 


Ado26413 Rat trunc 


29 


1495.5 


25. 


2 


1122 


3 


AAZ56888 


Aaz56888 Human MAG 


30 


1495.5 


25. 


2 


1122 


4 


AAF90325 


Aaf90325 Human NOG 


31 


1495.5 


25. 


2 


1216 


6 


ABA05903 


Aba05903 Human RTN 


32 


1495.5 


25. 


2 


1610 


3 


AAZ36230 


Aaz36230 cDNA enco 


33 


1495.5 


25. 


2 


2235 


6 


ABV94681 


Abv94681 Human pan 


34 


1495.5 


25. 


2 


2240 


3 


AAC64406 


Aac64406 Human Nog 


35 


1491.5 


25. 


2 


1599 


10 


ADI62860 


Adi62860 Human apo 


36 


1487.5 


25. 


1 


2052 


6 


ABK90133 


Abk90133 DNA encod 


37 


1478.5 


25. 


0 


1694 


4 


AAK94408 


Aak94 4 08 Human ful 


38 


1478.5 


25. 


0 


1694 


12 


ADL31137 


Adl31137 Full leng 


39 


1437.5 


24. 


3 


2100 


8 


ABT42936 


Abt42936 Human neu 


40 


1417 


23. 


9 


868 


3 


AAZ56887 


Aaz56887 Human MAG 


41 


1340 


22. 


6 


1683 


4 


AAD08386 


Aad08386 Human sec 


42 


1225.5 


20. 


7 


2782 


10 


ADB85284 


Adb85284 Rat fooce 


43 


981 


16. 


6 


555 


12 


ADQ16423 


Adql6423 Nucleotid 


44 


981 


16. 


6 


2934 


12 


ADQ16433 


Adql6433 Construct 



c 45 932.5 15.7 1758 4 AAF32725 



Aaf32725 Human sec 



ALIGNMENTS 

RESULT 1 
ADP45550 

ID ADP45550 standard; cDNA; 3919 BP. 
XX 

AC ADP45550; 
XX 

DT 09-SEP-2004 (first entry) 
XX 

DE Human NogoA encoding cDNA SEQ ID NO: 4. 
XX 

KW binding molecule; human; NogoA; NiG; NiG-D20; NogoA_623-640 ; 

KW nerve repair; neuroprotective; gene therapy; 

KW central nervous system injury; CNS injury; neurodegenerative disorder; 

KW gene; ss. 

XX 

OS Homo sapiens. 
XX 

FH Key Location/Qualifiers 

FT CDS 1. .3579 

FT /*tag= a 

FT /product^ "NogoA" 

XX 

PN WO2004052932-A2. 
XX 

PD 24-JUN-2004. 
XX 

PF 09-DEC-2003; 2003WO-EP013960 . 
XX 

PR 10-DEC-2002; ^2002GB-00028832 . 
XX 

PA (NOVS ) NOVARTIS AG. 

PA (NOVS ) NOVARTIS PHARMA GMBH. 

PA (UYZU-) UNIV ZUERICH. 

XX 

PI Barske C, Mir AK, Oertle T, Schnell L, Schwab ME, Vitaliti A; 

PI Zurini M; 

XX 

DR WPI; 2004-468818/44. 

DR P-PSDB; ADP45551. 
XX 

PT New binding molecule that binds to the human NogoA polypeptide, NiG, NiG- 

PT D20 or NogoA623-640, useful in preparing a composition for treating CNS 

PT injury or neurodegenerative disorders. 
XX 

PS Example 1; SEQ ID NO 4; 121pp; English. 
XX 

CC The present invention describes a binding molecule which binds to human 

CC NogoA polypeptide, human NiG, human N1G-D20 or human NogoA_623-640 with a 

CC dissociation constant of less than lOOOnM. Also described: (1) a 

CC polynucleotide encoding the binding molecule; (2) an expression vector or 

CC system comprising the polynucleotide; (3) a host cell comprising the 

CC expression system; (4) a pharmaceutical composition comprising the 



GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - nucleic search, using f rame__plus_p2n model 



Run on: April 16, 2005, 11:35:57 ; Search time 514.448 Seconds 

(without alignments) 
3746.799 Million cell updates/sec 

Title: US-09- 830-972-29 

Perfect score: 5923 

Sequence: 1 MEDLDQSPLVSSSDSPPRPQ VKDAMAKIQAKI PGLKRKAE 1178 



Scoring table: 

Searched: 
Total number o 



BLOSUM62 

Xgapop 10.0 , 

Ygapop 10.0 , 

Fgapop 6.0 , 

Delop 6.0 , 

1202784 seqs, 

hits satisfyi 



Xgapext 0.5 

Ygapext 0 . 5 

Fgapext 7.0 

Delext 7.0 

818138359 residues 

Lg chosen parameters: 2405568 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Command line parameters : 
-MODEL=frame+_p2n. model -DEV=xlh 

Q=/cgn2_l/USPTO_spool_h/US09830972/runat_14042005_094 844_18372/app_query. fasta_l 
.3683 

-DB=Issued_Patents__NA -QFMT=fastap -SUFFIX=rni -MINMATCH=0 . 1 -LOOPCL=0 
-LOOPEXT=0 -UNITS=bits -START=1 -END=-1 -MATRIX=blosum62 -TRANS=human4 0 . cdi 
-LIST=45 -DOCALIGN=200 -THR_SCORE=pct -THR_MAX=100 -THR_MIN=0 -ALIGN=15 
-MO DE= LOCAL -OUTFMT=pto -NORM=ext -HEAPSIZE=500 -MINLEN=0 -MAXLEN=2000000000 
-USER=US09830972_@CGN_l_l_255_@runat_14042005_094844_18372 -NCPU=6 -ICPU=3 
-NO_MMAP -LARGEQUERY -NEG_SCORES=0 -WAIT -DSPBLOCK=100 -LONGLOG 

-DEV_TIMEOUT=120 -WARN_TIMEOUT=30 -THREADS=1 -XGAPOP=10 -XGAPEXT=0 . 5 -FGAPOP=6 
-FGAPEXT=7 -YGAPOP=10 -YGAPEXT=0.5 -DELOP=6 -DELEXT=7 



Database : Issued_Patents_NA: * 

1 : /cgn2_6/ptodata/l/ina/5A_COMB. seq: * 

2 : /cgn2_6/ptodata/l/ina/5B_COMB. seq: * 

3 : / cgn2_6/ptodata/ 1/ina/ 6A_COMB . seq : * 

4 : /cgn2_6/ptodata/l/ina/6B_COMB. seq: * 

5 : /cgn2_6/ptodata/l/ina/PCTUS_COMB. seq: * 

6: /cgn2_6/ptodata/l/ina/backf ilesl . seq: * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 





No. 


Score 


Match 


Length 


DB 


ID 








Description 




1 


5748 


97. 


0 


4822 


3 


us- 


09- 


484- 


970B-106 


Sequence 


106, App 




2 


931 


15. 


7 


799 


2 


us- 


08- 


700- 


607-2 


Sequence 


2, Appli 




3 


931 


15. 


7 


1669 


4 


us- 


09- 


949- 


016-3253 


Sequence 


3253, Ap 




4 


918 


15. 


5 


2610 


4 


us- 


09- 


023- 


655-382 


Sequence 


382, App 




5 


800.5 


13. 


5 


3202 


4 


us- 


09- 


949- 


016-1127 


Sequence 


1127, Ap 




6 


750 


12. 


7 


3517 


4 


us- 


09- 


799- 


451-111 


Sequence 


111, App 




7 


727 . 5 


12. 


3 


2069 


4 


us- 


09- 


949- 


016-3309 


Sequence 


3309, Ap 




8 


707 


11. 


9 


428 


4 


us- 


09- 


513- 


999C-923 


Sequence 


923, App 




9 


635 


10. 


7 


1766 


3 


us- 


09- 


149- 


476-254 


Sequence 


254, App 




10 


635 


10. 


7 


2664 


3 


us- 


09- 


149- 


476-255 


Sequence 


255, App 




11 


627.5 


10. 


6 


2262 


4 


us- 


09- 


949- 


016-2988 


Sequence 


2988, Ap 




12 


576 


9. 


7 


382 


4 


us- 


09- 


513- 


999C-11526 


Sequence 


11526, A 




13 


527.5 


8. 


9 


1095 


2 


us- 


08- 


700- 


607-4 


Sequence 


4, Appli 




14 


526.5 


8. 


9 


2014 


4 


us- 


09- 


270- 


767-13561 


Sequence 


13561, A 




15 


517 


8. 


7 


794 


3 


us- 


09- 


149- 


476-102 


Sequence 


102, App 




16 


514 


8. 


7 


2181 


4 


us- 


•09- 


■949- 


016-1419 


Sequence 


1419, Ap 




17 


347 


5. 


9 


441 


4 


us- 


09- 


513- 


999C-2227 


Sequence 


2227, Ap 




18 


344.5 


5. 


8 


454 


4 


us- 


■09- 


621- 


976-740 


Sequence 


740, App 




19 


344.5 


5. 


8 


463 


4 


us- 


■09- 


■621- 


976-741 


Sequence 


741, App 


c 


20 


341 


5. 


8 


601 


4 


us- 


■09- 


■949- 


016-117588 


Sequence 


117588, 


c 


21 


341 


5. 


8 


601 


4 


us- 


■09- 


■949- 


016-117589 


Sequence 


117589, 




22 


341 


5. 


8 


42075 


4 


us- 


■09- 


■949- 


016-14995 


Sequence 


14995, A 




23 


305 


5. 


1 


7571 


4 


us- 


■09- 


•949- 


■016-4366 


Sequence 


4366, Ap 




24 


305 


5. 


1 


8146 


4 


us- 


■09- 


•976- 


594-725 


Sequence 


725, App 




25 


305 


5. 


1 


11917 


4 


us- 


■09- 


•566- 


921-32 


Sequence 


32, Appl 




26 


305 


5. 


.1 


101300 


4 


us- 


■09- 


•949- 


■016-16108 


Sequence 


16108, A 




27 


301 


5. 


, 1 


145287 


4 


us- 


■09- 


-949- 


016-13530 


Sequence 


13530, A 




28 


301 


5. 


, 1 


145287 


4 


us- 


■09- 


-949- 


■016-13531 


Sequence 


13531, A 




29 


294 


5. 


.0 


443 


4 


us- 


-09- 


•513- 


•999C-3784 


Sequence 


3784, Ap 




30 


288 


4 . 


.9 


8560 


4 


us- 


■09- 


-949- 


■016-1788 


Sequence 


1788, Ap 




31 


288 


4. 


,9 


8560 


4 


us- 


-09- 


-949- 


•016-1789 


Sequence 


1789, Ap 




32 


280 


4. 


,7 


135667 


4 


us- 


-09- 


-949- 


■016-15051 


Sequence 


15051, A 




33 


280 


4. 


.1 


152486 


4 


us- 


-09- 


-949- 


■016-12869 


Sequence 


12869, A 


c 


34 


277 


4, 


.7 


601 


4 


us- 


■09- 


-949- 


■016-40169 


Sequence 


40169, A 


c 


35 


277 


4. 


.7 


601 


4 


us- 


-09- 


-949- 


•016-119335 


Sequence 


119335, 




36 


276. 5 


4, 


.7 


5361 


3 


us- 


-08- 


-973- 


-462-2 


Sequence 


2, Appli 




37 


276.5 


4 , 


,7 


6152 


3 


us- 


-08- 


-973- 


-462-1 


Sequence 


1, Appli 




38 


276 


4 . 


.7 


261 


2 


us- 


■08- 


-700- 


-607-9 


Sequence 


9, Appli 




39 


271 


4. 


.6 


13906 


4 


us- 


-09- 


-949- 


-016-14730 


Sequence 


14730, A 




40 


265 


4, 


.5 


5200 


4 


us- 


■08- 


-978- 


-277A-3 


Sequence 


3, Appli 




41 


264.5 


4. 


.5 


8224 


6 


5180808-1 




Patent No, 


. 5180808 




42 


264.5 


4, 


.5 


8224 


6 


5180808-1 




Patent No, 


. 5180808 




43 


261 


4, 


.4 


14770 


4 


us- 


-09- 


-949- 


-016-1107 


Sequence 


1107, Ap 




44 


258 


4, 


.4 


13187 


4 


us- 


-09- 


-949- 


-016-5062 


Sequence 


5062, Ap 




45 


257.5 


4 


.3 


6608 


4 


us- 


-09- 


-220- 


-132-58 


Sequence 


58, Appl 



ALIGNMENTS 



RESULT 1 

US-09-484-970B-106 



GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - nucleic search, using f rame_plus_p2n model 

Run on: April 16, 2005, 22:42:49 ; Search time 1809.84 Seconds 

(without alignments) 
3948.747 Million cell updates/sec 



Title: 

Perfect score: 
Sequence: 

Scoring table: 



Searched: 



US-09-830-972-29 
5923 

1 MEDLDQSPLVSSSDSPPRPQ VKDAMAKIQAKI PGLKRKAE 1178 

BLOSUM62 

Xgapop 10.0 , Xgapext 0.5 
Ygapop 10.0 , Ygapext 0.5 
Fgapop 6.0 , Fgapext 7 . 0 
Delop 6.0 , Delext 7.0 

5622541 seqs, 3033355566 residues 



Total number of hits satisfying chosen parameters: 



11245082 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Command line parameters: 
-MODEL=frame+_p2n. model -DEV=xlh 

Q=/cgn2_l/USPTO_spool_h/US09830972./runat_14042005_094847_184 66/app_query. fastaJL 
.3683 

-DB=Published_Applications_NA -QFMT=fastap -SUFFIX=rnpb -MINMATCH=0 . 1 
-LOOPCL=0 -LOOPEXT=0 -UNITS=bits -START=1 -END=-1 -MATRIX=blosum62 
-TRANS=human40 . cdi -LIST=45 -DOCALIGN=200 -THR__SCORE=pct -THR_MAX=100 
-THR_MIN=0 -ALIGN=15 -M0DE=L0CAL -OUTFMT^pto -NORM=ext -HEAPSI ZE=500 -MINLEN=0 
-MAXLEN=2 000000000 -USER=US09830972_@CGN_l_l_2092_@runat_14 042005__094 847_184 66 
-NCPU=6 -ICPU=3 -NO_MMAP -LARGEQUERY -NEG_SCORES=0 -WAIT -DSPBLOCK=100 
-LONGLOG -DEV_TIMEOUT=12 0 -WARN_TIMEOUT=30 -THREADS-1 -XGAPOP=10 -XGAPEXT=0 . 5 
-FGAPOP=6 -FGAPEXT =7 -YGAPOP=10 -YGAPEXT=0 . 5 -DELOP=6 -DELEXT=7 



Database : Published_Applications_NA: * 

1 : /cgn2_6/ptodata/2/pubpna/US07_PUBCOMB . seq : * 

2 : /cgn2_6/ptodata/2/pubpna/PCT_NEW_PUB . seq: * 

3: /cgn2_6/ptodata/2/pubpna/US06_NEW_PUB.seq:* 

4 : /cgn2_6/ptodata/2/pubpna/US06_PUBCOMB. seq: * 

5: /cgn2_6/ptodata/2/pubpna/US07_NEW_PUB. seq: * 

6: /cgn2_6/ptodata/2/pubpna/PCTUS_PUBCOMB. seq: * 

7 : /cgn2_6/ptodata/2/pubpna/US08_NEW_PUB. seq: * 

8: /cgn2_6/ptodata/2/pubpna/US08_PUBCOMB.seq:* 

9: /cgn2_6/ptodata/2/pubpna/US09A_PUBCOMB.seq:* 

10: /cgn2_6/ptodata/2/pubpna/US09B_PUBCOMB.seq:* 



1 1 
1 J. 


/ cgn^ 


1Z 


/ cgnz 




/ — O 

/ cgn^ 


1 /I 


/ cgn2 


1 R 
±0 


/ cgnz 


J. D 


/ cgn^ 


17 


/ cgn2 


18 


/ cgn2 


19 


: / cgn2 


20 


: /cgn2 


21 


: /cgn2 


22 


: /cgn2 



_6/ptodata/2/pubpna/US 
_6/ptodata/2/pubpna/US 
_6/ptodata/2/pubpna/US 
_6/ptodata/2/pubpna/US 

6/ptodata/2/pubpna/US 
~6/ptodata/2/pubpna/US 
_6/ptodata/2/pubpna/US 

6/ptodata/2/pubpna/US 

6/ptodata/2/pubpna/US 
~6/ptodata/2/pubpna/US 

6/ptodata/2/pubpna/US 
~6/ptodata/2/pubpna/US 



09C_PUBCOMB.seq: 
09_NEW_PUB.seq:* 
10A_PUBCOMB. seq: 
10B_PUBCOMB. seq: 
10C_PUBCOMB. seq: 
10D_PUBCOMB.seq: 
10E_PUBCOMB. seq: 
10F_PUBCOMB.seq: 
10_NEW_PUB.seq: * 
1 1_NEW_PUB . seq : * 
60_NEW_PUB.seq:* 
60 PUBCOMB.seq:* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result 
No. 



Score 



% 

Query 

Match Length DB 



ID 



Description 



1 


5815 


98 . 


2 


4053 


9 


US -09-758-14 0-5 


Sequence 5, Appli 


2 


5815 


98 . 


2 


4053 


9 


US-09-972-599A-5 


Sequence 5, Appli 


3 


5815 


98 . 


2 


4053 


18 


ttn -i r\ ITT cm oin 

US-10-71 /-by /-31U 


Sequence 310, App 


4 


5815 


98 . 


2 


4632 


14 


US-IO-060-Ood-oo 


Sequence 53, Appl 


5 


5810 


98 . 


1 


3579 


9 


US- 09- /oy-ioD-1 


Sequence 1/ Appli 


6 


5810 


98 . 


1 


3579 


9 


US- 09-890-34 o-ZZ 


oequence zz r /vppj. 


7 


5810 


98 . 


1 


3579 


17 


US-lu— Zo i -oUZ-Z 1Z 


bequence <£±z, >vpp 


8 


5810 


98. 


1 


3579 


18 


US-10-327-213-8 


Sequence 8, Appli 


9 


5810 


98. 


1 


3579 


18 


US-10-466-258-8 


Sequence 8, Appli 


10 


5810 


98. 


1 


3579 


18 


US-10-810-653-22 


Sequence 22, Appl 


11 


4296.5 


72. 


5 


4684 


9 


US-09-893-348-17 


Sequence 17, Appl 


12 


4296.5 


72. 


5 


4684 


18 


US-10-810-653-17 


Sequence 17, Appl 


13 


4277.5 


72. 


2 


3492 


17 


US-10-267-502-214 


Sequence 214, App 


14 


2731 


46. 


1 


60615 


18 


US-10-633-423-9 


Sequence 9, Appli 


15 


2731 


46. 


1 


60615 


18 


US-10-427-741-9 


Sequence 9, Appli 


16 


2511.5 


42. 


4 


1980 


17 


US-10-220-891-22 


Sequence 22, Appl 


17 


1495.5 


25. 


2 


1122 


9 


US-09-789-386-5 


Sequence 5, Appli 


18 


1495.5 


25. 


2 


1610 


9 


US-09-765-205-5 


Sequence 5, Appli 


19 


1495.5 


25. 


2 


2235 


14 


US-10-060-036-54 


Sequence 54, Appl 


20 


1487.5 


25. 


1 


2052 


18 


US-10-466-258-3 


Sequence 3, Appli 


21 


1437.5 


24. 


3 


2100 


19 


US-10-479-081-217 


Sequence 217, App 


22 


1417 


23. 


9 


868 


9 


US-09-789-386-3 


Sequence 3, Appli 


23 


1225.5 


20. 


7 


2782 


15 


US-10-205-194-165 


Sequence 165, App 


24 


931 


15. 


7 


799 


17 


US-10-660-946-2 


Sequence 2, Appli 


25 


931 


15. 


7 


1160 


14 


US-10-175-523-156 


Sequence 156, App 


26 


931 


15. 


7 


1785 


17 


US-10-439-388-62 


Sequence 62 , Appl 


27 


922 


15. 


6 


994 


11 


US-09-978-360A-110 


Sequence 110, App 


28 


918 


15. 


5 


2610 


17 


US-10-641-643-382 


Sequence 382 , App 


29 


890 


15. 


0 


1798 


18 


US-10-466-258-10 


Sequence 10, Appl 


30 


867 


14. 


6 


1514 


9 


US-09-823-245A-349 


Sequence 349, App 


31 


800.5 


13. 


5 


3202 


9 


US-09-954-456-210 


Sequence 210, App 


32 


800.5 


13. 


5 


3202 


17 


US-10-172-118-386 


Sequence 386, App 


33 


800.5 


13. 


5 


3202 


17 


US-10-342-887-386 


Sequence 386, App 


34 


800.5 


13. 


5 


3202 


18 


US-10-723-860-1480 


Sequence 1480, Ap 


35 


800.5 


13, 


5 


3202 


19 


US-10-843-641A-3237 


Sequence 3237, Ap 



c 42 
43 

c 44 
45 



36 
37 
38 
39 
40 
41 



707.5 
707.5 
707.5 
707.5 



800.5 
789.5 
781.5 



750 
709 
709 



13.5 
13.3 
13.2 
12.7 
12.0 
12.0 
11.9 
11.9 
11.9 
11.9 



3305 
2331 
2343 
3517 
3637 
3637 
901 
901 
901 
901 



18 
17 
17 
17 
17 
17 
18 
18 
19 
19 



US 
US 
US 
US 
US 

us 
us 
us 
us 
us 



10-723 
10-267 
10-267 
10-302 
10-108 
10-159 
10-363 
10-363 
10-363 
10-363 



860-5926 

502-213 

502-215 

172-111 

260A-449 

563-443 



345A-20065 
345A-20066 
483A-20065 
483A-20066 



Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 



5926, Ap 
213, App 
215, App 
111, App 
449, App 
443, App 

20065, A 

20066, A 

20065, A 

20066, A 



ALIGNMENTS 



RESULT 1 
US-09-758-140-5 

; Sequence 5, Application US/09758140 
; Patent No. US20020012965A1 
; GENERAL INFORMATION: 

; APPLICANT: Strittmatter , Stephen M. 

; TITLE OF INVENTION: No. US20020012 965Alo Receptor-Mediated Blockade of 
Axonal Growth 

; FILE REFERENCE: 44574-5073-US 

; CURRENT APPLICATION NUMBER: US/09/758,140 

; CURRENT FILING DATE: 2001-01-12 

; PRIOR APPLICATION NUMBER: US 60/175,707 

; PRIOR FILING DATE: 2000-01-12 

; PRIOR APPLICATION NUMBER: US 60/207,366 

PRIOR FILING DATE: 2000-05-26 
; PRIOR APPLICATION NUMBER: US 60/236,378 
; PRIOR FILING DATE: 2000-09-29 
; NUMBER OF SEQ ID NOS : 20 
; SOFTWARE: Patentln Ver. 2.1 
; SEQ ID NO 5 

LENGTH: 4 053 

TYPE: DNA 
; ORGANISM: Homo sapiens 

FEATURE : 

NAME /KEY: CDS 

LOCATION: ( 135 )..( 3710 ) 

OTHER INFORMATION: Human mRNA for No. US20020012965Alo protein (KIAA0886, 
GenBank 

OTHER INFORMATION: Accession No. US20020012965A1 AB020693) 
US-09-758-140-5 



Alignment Scores: 



Pred. No. : 


0 


Length: 


4053 


Score: 


5815.00 


Matches : 


1160 


Percent Similarity: 


97.57% 


Conservative : 


4 


Best Local Similarity: 


97.23% 


Mismatches : 


13 


Query Match: 


98.18% 


Indels : 


16 


DB: 


9 


Gaps : 


3 



US-09-830-972-29 (1-1178) x US-09-758-140-5 (1-4053) 



Qy 



1 MetGluAspLeuAspGlnSerProLeuValSerSerSerAspSerProProArgProGln 20 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - nucleic search, using f rame_plus_p2n model 

April 16, 2005, 11:30:21 ; Search time 14400.4 Seconds 

(without alignments) 
3963.787 Million cell updates/sec 

US-09-830-972-29 
5923 

1 MEDLDQSPLVSSSDSPPRPQ VKDAMAKIQAKI PGLKRKAE 1178 

BLOSUM62 

Xgapop 10.0 , Xgapext 0.5 
Ygapop 10.0 , Ygapext 0.5 
Fgapop 6.0 , Fgapext 7.0 
Delop 6.0 , Delext 7.0 

4708233 seqs, 24227607955 residues 

Total number of hits satisfying chosen parameters: 9416466 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Command line parameters: 
-MODEL=frame+_p2n. model -DEV=xlh 

Q=/cgn2_l/USPTO_spool__h/US09830972/runat_14042005_094 842__1834 8/app_query. fasta_l 
.3683 

-DB=GenEmbl -QFMT=fastap -SUFFIX=rge -MINMATCH=0 . 1 -LOOPCL=0 -LOOPEXT=0 
-UNITS=bits -START=1 -END=-1 -MATRIX=blosum62 -TRANS=human4 0 . cdi -LIST=45 
-DOCALIGN=200 -THR_SCORE=pct -THR_MAX=100 -THR_MIN=0 -ALIGN=15 -MODE- LOCAL 
-OUTFMT=pto -NORM=ext -HEAPSIZE=500 -MINLEN=0 -MAXLEN-2 0 00 0000 00 - 
-USER=US09830972_@CGN_l_l_19954_@runat_14 042005_094842_18348 -NCPU=6 -ICPU-3 
-NO_MMAP -LARGEQUERY -NEG_SCORES=0 -WAIT -DSPBLOCK=100 -LONGLOG 

-DEV_TIMEOUT=120 -WARN_TIMEOUT=30 - THREADS =1 -XGAPOP=10 -XGAPEXT=0.5 -FGAPOP- 6 
-FGAPEXT=7 -YGAPOP=10 -YGAPEXT=0 . 5 -DELOP=6 -DELEXT-7 



GenEmbl : * 


1: 


gb_ba : * 


2 


gb_htg : 


3 


gb_in : * 


4 


gb_om: * 


5 


gb_o v : * 


6 


: gb_pat: 


7 


: gb_ph : * 


8 


: gb_pl : * 


9 


: gb_pr:* 


10: gb_ro: 



Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



Searched: 



11: gb_sts:* 
12: gb_sy:* 
13: gb_un:* 
14: gb_vi:* 

Pred. Nol is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 



ro. 


Score 


Match 


Length 


DB 


ID 


Description 


l 


5815 


98.2 


3919 


6 


CQ829486 


CQ829486 Sequence 


2 


5815 


98.2 


4053 


6 


AX195249 


AX195249 Sequence 


3 


5815 


98.2 


4053 


9 


AB020693 


AB020693 Homo sapi 


4 


5815 


98.2 


4166 


9 


AB040462 


AB040462 Homo sapi 


5 


5815 


98.2 


4632 


9 


AF148537 


AF148537 Homo sapi 


6 


5815 


98.2 


4789 


6 


CQ874017 


CQ874017 Sequence 


.7 


5815 


98.2 


4789 


9 


AY102279 


AY102279 Homo sapi 


8 


5810 


98.1 


3576 


6 


AX766050 


AX766050 Sequence 


9 


5810 


98.1 


3579 


6 


BD249446 


BD249446 Protein s 


10 


5810 


98.1 


3579 


9 


HSA251383 


AJ251383 Homo sapi 


11 


5788 


97.7 


4093 


6 


BD270070 


BD270070 Secreted 


12 


5748 


97.0 


4822 


6 


AR220865 


AR220865 Sequence 


13 


4876 


82.3 


4102 


9 


AY123245 


AY123245 Homo sapi 


14 


4866 


82.2 


4123 


9 


AY123247 


AY123247 Homo sapi 


15 


4864 


82.1 


4160 


9 


AY123246 


AY123246 Homo sapi 


16 


4863 


82.1 


4109 


9 


AY123248 


AY123248 Homo sapi 


17 


4858 


82.0 


4060 


9 


AY123250 


AY123250 Homo sapi 


18 


4858 


82.0 


4070 


9 


AY123249 


AY123249 Homo sapi 


19 


4856 


82.0 


3491 


9 


AF333336 


AF333336 Homo sapi 


20 


4626 


78.1 


2974 


6 


CQ716296 


CQ716296 Sequence 


21 


4625 


78.1 


2883 


9 


AF320999 


AF320999 Homo sapi 


22 


4296.5 


72.5 


3489 


6 


AX766046 


AX766046 Sequence 


23 


4296.5 


72.5 


3492 


6 


CQ829507 


CQ829507 Sequence 


24 


4296.5 


72.5 


4684 


10 


RN0242961 


AJ242961 Rattus no 


25 


4289 


72.4 


4627 


10 


AY102284 


AY102284 Mus muscu 


26 


4280 


72.3 


4518 


10 


BC056373 


BC056373 Mus muscu 


27 


4277.5 


72.2 


3821 


10 


AY114152 


AY114152 Mus muscu 


28 


3938 


66.5 


39674 


9 


AC092461 


AC092461 Homo sapi 


29 


3938 


66.5 


90756 


9 


AY102285 


AY102285 Homo sapi 


30 


3938 


66.5 


162692 


2 


AC016171 


AC016171 Homo sapi 


31 


3658.5 


61.8 


4063 


10 


AY102280 


AY102280 Mus muscu 


32 


3561 


60.1 


3815 


10 


BC032272 


BC032272 Mus muscu 


33 


3441 


58.1 


2481 


9 


AF063601 


AF063601 Homo sapi 


34 


3306.5 


55.8 


2386 


6 


BD190738 


BD190738 Secreted 


35 


3306.5 


55.8 


2386 


6 


AX099401 


AX099401 Sequence 


36 


2731 


46.1 


60615 


10 


AY102286 


AY102286 Mus muscu 


37 


2731 


46.1 


166516 


2 


AC135510 


AC135510 Mus muscu 


38 


2731 


46.1 


211357 


2 


AC113284 


AC113284 Mus muscu 


39 


2731 


46.1 


212042 


10 


AL929371 


AL929371 Mouse DNA 


40 


2711.5 


45.8 


218532 


2 


AC131431 


AC131431 Rattus no 


41 


2711.5 


45.8 


238341 


2 


AC133315 


AC133315 Rattus no 


42 


2661 


44.9 


2958 


10 


BC032192 


BC032192 Mus muscu 


43 


2593.5 


43.8 


2425 


6 


CQ814527 


CQ814527 Sequence 



44 2511.5 42.4 1980 6 BD083733 BD083733 Nucleic a 

45 2511.5 42.4 1980 6 BD097380 BD097380 Nucleic a 



ALIGNMENTS 

RESULT 1 
CQ829486 

LOCUS CQ829486 3919 bp DNA linear PAT 05-JUL-2004 

DEFINITION Sequence 4 from Patent WO2004052932 . 

ACCESSION CQ829486 

VERSION CQ829486.1 GI: 49732808 

KEYWORDS 

SOURCE Homo sapiens (human) 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
REFERENCE 1 

AUTHORS Barske,C, Mir,A.K., Oertle,T., Schnell,L., Schwab, M.E., 

Vitaliti,A. and Zurini,M. 
TITLE Nogo a binding molecules and pharmaceutical use thereof 

JOURNAL Patent: WO 2004052932-A 4 24-JUN-2004; 
Novartis AG (CH) 
FEATURES Location/Qualifiers 
source .1. .3919 

/organism="Homo sapiens" 
/mol_type="unas signed DNA 11 
/db_xref =" taxon : 9606" 
CDS 1. .3579 

/note="unnamed protein product; Human NogoA" 
/ codon_start=l 
/protein_id="CAH03193.1" 
/db_xref="GI : 49732809" 

/ trans lation-"MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEED 

EDEDLEELEVLERKPAAGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAP 

ERQPSWDPSPVSSTVPAPSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWT 

PPAPAPAAPPSTPAAPKRRGSSGSVDETLFALPAASEPVIRSSAENMDLKEQPGNTIS 

AGQEDFPSVLLETAASLPSLSPLSAASFKEHEYLGNLSTVLPTEGTLQENVS EASKEV 

SEKAKTLLIDRDLTEFSELEYSEMGSSFSVSPKAESAVIVANPREEIIVKNKDEEEKL 

VSNNILHNQQELPTALTKLVKEDEWSSEKAKDSFNEKRVAVEAPMREEYADFKPFER 

VWEVKDSKEDSDMLAAGGKIESNLESKVDKKCFADSLEQTNHEKDSESSNDDTSFPST 

PEGIKDRSGAYITCAPFNPAATESIATNIFPLLGDPTSENKTDEKKIEEKKAQIVTEK 

NTSTKTSNPFLVAAQDSETDYVTTDNLTKVTEEWANMPEGLTPDLVQEACESELNEV 

TGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSFEESEATPSPVLPDIVMEAPLNSAV 

PSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYEEAMSVSLKKVSGIKEEIKEPEN 

INAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSEMAKVEQPVPDHSELVEDSSP 

DSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMIEYENKEKLSALPPEGGKP 

YLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVYSNDDLFISKEAQIRET 

ETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEIANAPDGAGSLPCTE 

LPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALATQAEIESIVKPKV 

LVKEAEKKLPSDTEKEDRSPSAIFSAELSKTSWDLLYWRDIKKTGWFGASLFLLLS 

LTVFS I VSVTAYI ALALLS VT I S FRI YKGVIQAIQKS DEGHP FRAYLES EVAI S EELV 

QKYSNSALGHWCTIKELRRLFLVDDLVDSLKFAVIJVrW 

SLFSVPVI YERHQAQI DHYLGLANKNVKDAMAKIQAKI PGLKRKAE " 

ORIGIN 



Alignment Scores: 



Qy 946 ThrGlnAlaGluIleGluSerlleValLysProLysValLeuGluLysGluAlaGluLys 965 

I I I I I I I I I I I I I I 11 I I I I I ! I 1 I I I I I I It I I I I I I I I I I I I M II I I I I I I I I I 
Db 3019 ACTCAAGCAGAGATAGAGAGCATAGTTAAACCCAAAGTTCTTGTGAAAGAAGCTGAGAAA 3078 

Qy 966 LysLeuProSerAspThrGluLysGluAspArgSerProSerAlallePheSerAlaAsp 985 

I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I II I I I I I I I II I I I I :: : 
Db 3079 AAACTT CCTT CC GAT AC AGAAAAAGAGGACAGAT C AC CAT CT GCT AT ATT T T CAGCAGAG 3138 

Qy 986 LeuGlyLysThrSerValValAspLeuLeuTyrTrpArgAspIleLysLysThrGlyVal 1005 

III I I I I I I I I I I I I I I II I I I I I I I I Ml I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 3139 CTGAGTAAAACTTCAGTTGTT GACCTCCTGTACTGGAGAGACATTAAGAAGACT GGAGTG 3198 

Qy 1006 ValPheGlyAlaSerLeuPheLeuLeuLeuSerLeuThrValPheSerlleValSerVal 1025 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I J I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 
Db 3199 GTGTTTGGTGCCAGCCTATTCCTGCTGCTTTCATTGACAGTATTCAGCATTGTGAGCGTA 3258 

Qy 1026 ThrAlaTyrlleAlaLeuAlaLeuLeuSerValThrlleSerPheArglleTyrLysGly 1045 

I I I I I I I I I I I II I II I I I I II I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I M I I I 
Db 3259 ACAGCCTACATTGCCTTGGCCCTGCTCTCTGTGACCATCAGCTTTAGGATATACAAGGGT 3318 

Qy 1046 VallleGlnAlalleGlnLysSerAspGluGlyHisProPheArgAlaTyrLeuGluSer 1065 

I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 3319 GT GAT C CAAGCT AT C CAGAAAT CAGAT GAAGGC CAC CCATT C AGGGCAT AT CT GGAAT CT 3378 

Qy 1066 GluValAlalleSerGluGluLeuValGlnLysTyrSerAsnSerAlaLeuGlyHisVal 1085 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I r 
Db 3379 GAAGT T GCT AT AT CT GAGGAGTT GGTT C AGAAGT ACAGTAAT T CT GCT CTT GGT C ATGT G 3438 

Qy 1086 AsnCysThrlleLysGluLeuArgArgLeuPheLeuValAspAspLeuValAspSerLeu 1105 

I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I 
Db 3439 AACT GCAC GAT AAAGGAACT C AGGCGCCT CTT CTT AGTT GAT GAT TT AGT T GAT T CTCTG 3498 

"Qy 1106 LysPheAlaValLeuMetTrpValPheThrTyrValGlyAlaLeuPheAsnGlyLeuThr 1125 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I II 
Db 3499 AAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCCTTGTTTAATGGTCTGACA 3558 

Qy 1126 LeuLeuIleLeuAlaLeuIleSerLeuPheSerValProVallleTyrGluArgHisGln 1145 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I 
Db 3559 CTACTGATTTTGGCTCTCATTTCACTCTTCAGTGTTCCTGTTATTTATGAACGGCATCAG 3618 

Qy ■ 1146 AlaGlnlleAspHisTyrLeuGlyLeuAlaAsnLysAsnValLysAspAlaMetAlaLys 1165 

I I I I I I I I II I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 3619 GCACAGATAGATCATTATCTAGGACTT GCAAATAAGAATGTTAAAGAT GCTATGGCTAAA 3678 

Qy 1166 IleGlnAlaLysIleProGlyLeuLysArgLysAlaGlu 1178 

II I II I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I 

Db 3679 ATCCAAGCAAAAATCCCTGGATTGAAGCGCAAAGCTGAA 3717 



RESULT 5 
U5^fl3=- 785^38 6-1 

/(^Sequence i /^Application US/09789386 
; Pa^B4^<^tj S 2 0 0 2 0 0 1 0 3 2 4 A 1 

; GENERAL INFORJJ&KOi^ 

; APPLICAN < p<MICHALOVICH f DAVIDj) 

; TITLE OF INVENTION: NOVEL COMPOUNDS 
; FILE REFERENCE: GP-30165-C1 



; CURRENT APPLICATION NUMBER: US/09/789, 386 - 

; CURRENT FILING DATE: 20(31-02-21 

; PRIOR APPLICATION NUMBER: U.K. 9916898.1 

; PRIOR FILING DATE: 1999-07-19 

; PRIOR APPLICATION NUMBER: U.K. 9816024.5 

; PRIOR FILING DATE: 1998-07-22 

; PRIOR APPLICATION NUMBER: US 09/359,208 

; PRIOR FILING DATE: 1999-07-22 

; NUMBER OF SEQ ID NOS : 6 

; SOFTWARE: FastSEQ for Windows Version 3.0 
; SEQ ID NO 1 

LENGTH: 3579 

TYPE: DNA 
; ORGANISM: HOMO SAPIENS 
US-09-789-386-1 



Alignment Scores : 



Pred. No. : 


0 


Length : 


3579 


Score : 


5810.00 


Matches : 


1159 


Percent Similarity: 


97.49% 


Conservative : 


4 


Best Local Similarity: 


97.15% 


Mismatches : 


14 


Query Match: 


98.09% 


Indels : 


16 


DB: 


9 


Gaps : 


3 



US- 


09-830-972- 


-29 (1-1178) x US-09-789-386-1 (1-3579) 




Qy 


1 


MetGluAspLeuAspGlnSerProLeuValSerSerSerAspSerProProArgProGln 


20 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 11 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 I I 1 1 I M I I I I I I | | | 




Db 


1 


ATGGAAGACCTGGACCAGTCTCCTCTGGTCTCGTCCTCGGACAGCCCACCCCGGCCGCAG 


60 


Qy 


21 


ProAlaPheLysTyrGlnPheValArgGluProGluAspGluGluGluGluGluGluGlu 


40 






1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I I I I || I I I | | 




Db 


61 


C C CG CGT TCAAGT ACC AGT T C GT GAGGGAGCCCGAGGACGAGGAGGAAGAAGAGGAGGAG 


120 


Qy 


41 


GluGluGluAspGluAspGluAspLeuGluGluLeuGluValLeuGluArgLysProAla 


60 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


121 


GAAGAGGAGGAC GAGGAC GAAGAC CT GGAGGAGCT GGAGGTGCT GGAGAGGAAGC CC GC C 


180 


Qy 


61 


AlaGlyLeuSerAlaAlaProValProThrAlaProAlaAlaGlyAlaProLeuMetAsp 


80 






1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 ! 1 1 1 1 1 1 I II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 ! 1 1 1 i 1 1 1 1 1 1 




Db 


181 


GCCGGGCTGTCCGCGGCCCCAGTGCCCACCGCCCCTGCCGCCGGCGCGCCCCTGATGGAC 


240 


Qy 


81 


PheGlyAsnAspPheValProProAlaProArgGlyProLeuProAlaAlaProProVal 


100 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 I I 




Db 


241 


TTCGGAAATGACTTCGTGCCGCCGGCGCCCCGGGGACCCCTGCCGGCCGCTCCCCCCGTC 


300 


Qy 


101 


AlaProGluArgGlnProSerTrpAspProSerProValSerSerThrValProAlaPro 


120 






1 1 1 M 1 1 1 1 1 1 t 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 i 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I 1 1 1 1 




Db 


301 


GCCCCGGAGCGGCAGCCGTCTTGGGACCCGAGCCCGGTGTCGTCGACCGTGCCCGCGCCA 


360 


Qy 


121 


SerProLeuSerAlaAlaAlaValSerProSerLysLeuProGluAspAspGluProPro 


140 






1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


361 


TCCCCGCTGTCTGCTGCCGCAGTCTCGCCCTCCAAGCTCCCTGAGGACGACGAGCCTCCG 


420 


Qy 


141 


AlaArgProProProProProProAlaSerValSerProGlnAlaGluProValTrpThr 


160 






1 1 1 1 1 1 1 1 1 II 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


421 


GCCCGGCCTCCCCCTCCTCCCCCGGCCAGCGTGAGCCCCCAGGCAGAGCCCGTGTGGACC 


480 



Qy 161 ProProAlaProAlaProAlaAlaProProSerThrProAlaAlaProLysArgArgGly 180 

I I I I I I I I ! I I Ml I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I M I I I I I II I I I I I M I 

Db 481 CCGCCAGCCCCGGCTCCCGCCGCGCCCCCCTCCACCCCGGCCGCGCCCAAGCGCAGGGGC 540 

Qy 181 SerSerGlyAla — ValVal 186 

MINIMI::: |||::: 

Db 541 TCCTCGGGCTCAGTGGATGAGACCCTTTTTGCTCTTCCTGCTGCATCTGAGCCTGTGATA 600 

Qy 187 ************LysIleMetAspLeuLysGluGlnProGlyAsnThrIleSerAlaGly 206 

:: : I II I I I I I I M I I I I I I I I I I I I I I I I I M M I I I I I I I I I I 

Db 601 CGCTCCTCTGCAGAAAATATGGACTTGAAGGAGCAGCCAGGTAACACTATTTCGGCTGGT 660 

Qy 207 GlnGluAspPheProSerValLeuLeuGluThrAlaAlaSer+**ProSerLeuSerPro 226 

I I I I I I I I I I I I I I 11 I II I I II I I I I I I I I I I I M I I I I I I I I I II I I I I I I I I I I 

Db 661 CAAGAGGATTTCCCATCTGTCCTGCTTGAAACTGCTGCTTCTCTTCCTTCTCTGTCTCCT 720 

Qy 227 LeuSerAlaAlaSerPheLysGluHisGluTyrLeuGlyAsnLeuSerThrValLeuPro 24 6 

I I II I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 721 CT CTCAGC C GCTT CTT T CAAAGAACAT GAAT ACCTTGGT AATT T GT CAACAGT ATT ACCC 780 

Qy 247 ThrGluGlyThrLeuGlnGluAsnValSerGluAlaSisrLysGluValSerGluLysAla 266 

I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 781 ACTGAAGGAACACTTCAAGAAAATGTCAGTGAAGCTTCTAAAGAGGTCTCAGAGAAGGCA 840 

Qy 267 LysThrLeuLeuIleAspArgAspLeuThrGluPheSerGluLeuGluTyrSerGluMet 286. 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 841 AAAACT CT ACT CAT AGATAGAGATTT AACAGAGTTTT CAGAATT AGAAT ACTCAGAAAT G 900 

Qy 287 GlySerSerPheSerValSerProLysAlaGluSerAlaVallleValAlaAsnProArg 306 

I I II I I I I I I I I I I I I I I I I I I I M I I II I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I 

Db 901 GGAT CAT C GTT C AGT GT CT CT C CAAAAGC AGAAT CTGCCGTAAT AGT AGCAAAT CCT AGG 960 

Qy 307 GluGluIlelleValLysAsnLysAspGluGluGluLysLeuValSerAsnAsnlleLeu 326 

I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I 

Db 961 GAAGAAATAATCGT GAAAAATAAAGAT GAAGAAGAGAAGTTAGTTAGTAATAACAT CCTT 1020 

Qy 327 His***GlnGlnGluLeuProThrAlaLeuThrLysLeuValLysGluAspGluValVal 34 6 

III I I I I I II I I I I I II I I I I I I I I I I I I M I I I I I I I I M I I I I I I I I I II I I I I I 

Db 1021 CATAATCAACAAGAGTTACCTACAGCTCTTACTAAATT GGTTAAAGAGGATGAAGTT GTG 1080 

Qy 347 SerSerGluLysAlaLys7VspSerPheAsnGluLysArgValAlaValGluAlaProMet 366 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I III I I I I I I I I I II I I II I I I I I I I I I 

Db 1081 T CTTCAGAAAAAGCAAAAGACAGTTTTAAT GAAAAGAGAGTT GCAGTGGAAGCT CCTATG 114 0 

Qy 367 ArgGluGluTyrAlaAspPheLysProPheGluArgValTrpGluValLysAspSerLys 386 

I I I M I II I I I I I I I I I I I I I I I M I I I II I I I I I I I I I I I I I II I II I I I I I I I I I I I I 

Db 1141 AGGGAGGAAT AT GCAGACT T CAAAC CAT T T GAGC GAGT AT G GGAAGT GAAAGAT AGTAAG 1200 

Qy 387 GluAspSerAspMetLeuAlaAlaGlyGlyLysIleGluSerAsnLeuGluSerLysVal 406 

I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I 

Db 1201 GAAGATAGTGATAT GTTGGCT GCTGGAGGTAAAATCGAGAGCAACTT GGAAAGTAAAGTG 1260 

Qy 407 AspLysLysCysPheAlaAspSerLeuGluGlnThrAsnHisGluLysAspSerGluSer 426 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I II I II I I I I I I I 

Db 1261 GATAAAAAAT GTT T T GC AGAT AGCCTTGAGCAAACTAAT CACGAAAAAGAT AGT GAGAGT 1320 



Qy 427 SerAsnAspAspThrSerPheProSerThrProGluGlylleLysAspArgSerGlyAla 446 

I I II I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 

Db 1321 AGTAAT GAT GAT ACT T CTTT C CC C AGT ACGCCAGAAGGT ATAAAGG AT CGT CC AGGAGCA 1380 

Qy 447 TyrlleThrCysAlaProPheAsnProAlaAlaThrGluSerlleAlaThrAsnllePhe 466 

I I I I I I I I I I I I M I I II I I I I I I I II I I I I I I I I I I I I I I II I I II I I I II I I I II I I I 

Db 1381 T AT ATCACATGT GCT CCCTTT AAC G C AGCAGCAACT GAGAGCAT T GCAACAAACAT T T T T 1440 

Qy 467 ProLeuLeuGluAspProThrSerGluAsn***ThrAspGluLysLysIleGluGluLys 4 86 

I I I I I I I I I I I I i I I I I I I I I I I II II I I I I I I I I I I I I II I I I I I I I I I I I I I 

Db 1441 CCTTTGTT AGGAGAT C CTACT T C AGAAAAT AAGACCGAT GAAAAAAAAAT AGAAGAAAAG 1500 

Qy 487 LysAlaGlnlleValThrGluLysAsnThrSerThrLysThrSerAsnProPhePheVal 506 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I III 

Db 1501 AAGGCC CAAAT AGT AACAGAGAAGAAT ACTAGCACCAAAAC AT CAAAC CCTT TT CTT GTA 1560 

Qy 507 AlaAlaGlnAspSerGluThrAspTyrValThrThrAspAsnLeuThrLysValThrGlu 526 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I. I I I I I I I I II I I II I II I I I I I I I I I I I II I 

Db 1561 GCAGCACAGGATTCTGAGACAGATTATGTCACAACAGATAATTTAACAAAGGTGACTGAG 1620 

Qy 527 GluValValAlaAsnMetProGluGlyLeuThrProAspLeuValGlnGluAlaCysGlu 546 

I I I II I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I II I I I I I I I I 

Db 1621 GAAGT C GT GGCAAACAT GCCT GAAGGCCT GACT CCAGATT T AGT ACAGGAAGCAT GT GAA 1680 

Qy 547 SerGluLeuAsnGluValThrGlyThrLysIleAlaTyrGluThrLysMetAspLeuVal 566 

I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I 

Db 1681 AGTGAATTGAATGAAGTTACTGGTACAAAGATTGCTTATGAAACAAAAATGGACTTGGTT 1740 

Qy 567 GlnThrSerGluValMetGlnGluSerLeuTyrProAlaAlaGlnLeuCysProSerPhe 586 

I I I I | I I I I I III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I 

Db 1741 CAAACAT C AGAAGT T AT GCAAGAGT C ACTCT AT C CT GC AGCACAGCTTT GC C CAT C AT TT 1800 

Qy 587 GluGluSerGluAlaThrProSerProValLeuProAspIleValMetGluAlaProLeu 606 

I I I I I I I I I I I I I I I I II I I I I II I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I 

Db 1801 GAAGAGT C AGAAGCT ACT CCTT C AC CAGTT TT GCCT GAC ATT GT TAT GGAAGC ACC AT T G 1860 

Qy 607 AsnSerAlaValProSerAlaGlyAlaSerVallleGlnProSerSerSerProLeuGlu 626 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1861 AAT T CT GCAGT T C CT AGT GCT GGT GCTT CC GT GAT AC AGCCCAGCT CAT CACC ATT AGAA 1920 

Qy ' 627 AlaSerSerValAsnTyrGluSerlleLysHisGluProGluAsnProProProTyrGlu 646 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1921 GCTTCTTCAGTTAATTATGAAAGCATAAAACATGAGCCTGAAAACCCCCCACCATATGAA 1980 

Qy 647 GluAlaMetSerValSerLeu- — LysValSerGlylleLysGluGluIleLysGluPro 665 

I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 

Db 1981 GAGGC CAT GAGT GT AT CACT AAAAAAAGT AT C AGGAATAAAGGAAGAAAT TAAAGAG C CT 204 0 

Qy 666 GluAsnlleAsnAlaAlaLeuGlnGluThrGluAlaProTyrlleSerlleAlaCysAsp 685 

" I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 2041 GAAAATATT AATGCAGCT CTT CAAGAAACAGAAGCTCCTT ATATAT CTATTGCAT GT GAT 2100 

Qy 686 LeuIleLysGluThrLysLeuSerAlaGluProAlaProAspPheSerAspTyrSerGlu 705 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 2101 TTAAT TAAAGAAACAAAGCT T T CT GCT GAAC C AGCT CCGGATTT CT CT GAT TATT CAGAA 2160 

Qy 706 MetAlaLysValGluGlnProValProAspHisSerGluLeuValGluAspSerSerPro 725 



I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II LI I I I I 

Db 2161 AT GGCAAAAGTT GAACAGC C AGT G C CTGAT CATT CTGAGCT AGT T GAAGAT T C CTC AC CT 2220 

Qy 726 AspSerGluProValAspLeuPheSerAspAspSerlleProAspValProGlnLysGln 745 

II I I I I I I I I I I I I I I I I I I II I I II I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I II I 

Db 2221 GATTCT GAAC CAGTT GACT T ATTT AGTGAT GATT CAAT AC CT GACGT T C C ACAAAAACAA 2280 

Qy 746 AspGluThrValMetLeuValLysGluSerLeuThrGluThrSerPheGluSerMetlle 765 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I.I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2281 GAT GAAACT GTGAT GCT T GT GAAAGAAAGT CT CACT GAGACTTCAT T T GAGT CAAT GAT A 2340 

Qy 766 GluTyrGluAsnLysGluLysLeuSerAlaLeuProProGluGlyGlyLysProTyrLeu 785 

I I I II I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I 

Db 2341 GAAT AT GAAAATAAGGAAAAACT CAGTGCT TT GC C AC CT GAGGGAGGAAAGCCAT AT TT G 2400 

Qy 786 GluSerPheLysLeuSerLeuAspAsnThrLysAspThrLeuLeuProAspGluValSer 805 

II II I I I I I I I II I I I I I I I I I I I II I I I I I I I I I I I I I I I II I I II I I I I I I I I I I I I I 

Db 24 01 GAATCTTTTAAGCTCAGTTTAGATAACACAAAAGAT ACCCT GTTACCT GATGAAGTTTCA 2460 

Qy 806 ThrLeuSerLysLysGluLysIleProLeuGlnMetGluGluLeuSerThrAlaValTyr 825 

I I I I I I I H I I I I I I I I I I I I I I I I I I I I I I I I I ! I 1 I I I ! I I I I I I I I I I I I I I I I I I I 
Db 2461 ACATTGAGCAAAAAGGAGAAAATTCCTTTGCAGATGGAGGAGCT CAGTACT GCAGTTTAT 2520 

Qy 826 SerAsnAspAspLeuPhelleSerLysGluAlaGlnlleArgGluThrGluThrPheSer 845 

I I II I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2521 TCAAATGATGACTTATTTATTTCTAAGGAAGCACAGATAAGAGAAACTGAAACGTTTTCA 2580 

Qy 846 AspSerSerProIleGluIlelleAspGluPheProThrLeuIleSerSerLysThrAsp 865 

I I I I I I I II I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I.I I I I I I 
Db 2581 GAT T C AT CT C CAATT GAAATT AT AGATGAGTT C C CT ACAT T GAT CAGT T CTAAAACT GAT 2640 

Qy 866 SerPheSerLysLeuAlaArgGluTyrThrAspLeuGluValSerHisLysSerGluIle 885 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2641 T CATTTTCTAAATTAGCCAGGGAATATACT GACCTAGAAGTATCCCACAAAAGT GAAATT 2700 

Qy 886 AlaAsnAlaProAspGlyAlaGlySerLeuProCysThrGluLeuProHisAspLeuSer 905 

I I I II I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I II II I II I I I I I I I I 
Db 2701 GCTAATGCCCCGGATGGAGCTGGGTCATTGCCTTGCACAGAATTGCCCCATGACCTTTCT 2760 

Qy 906 LeuLysAsnlleGlnProLysValGluGluLysIleSerPheSerAspAspPheSerLys 925 

I I I I I I I I I II M I I I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I II I I I I I I I I I 
Db 2761 TTGAAGAACATACAACCCAAAGTT GAAGAGAAAATCAGTTTCTCAGAT GACTTTTCTAAA 2820 

Qy 926 AsnGlySerAlaThrSerLysValLeuLeuLeuProProAspValSerAlaLeuGlyHis 945 

I I II II II I I I I I I I I I I I I I I I I I I I I I M M I I I I I I I I I I I I I I I I II I I I 

Db 2821 AATGGGTCTGCTACATCAAAGGTGCTCTTATTGCCTCCAGATGTTTCTGCTTTGGCC 2877 

Qy 94 6 ThrGlnAlaGluIleGluSerlleValLysProLysValLeuGluLysGluAlaGluLys 965 

II I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I M I I I I I I I I I I I I I I I I 

Db 2878 ACTCAAGCAGAGATAGAGAGCATAGTTAAACCCAAAGTTCTTGTGAAAGAAGCTGAGAAA 2937 

Qy 966 LysLeuProSerAspThrGluLysGluAspArgSerProSerAlallePheSerAlaAsp 985 

I I I I I I I I M I II I I I I II I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I II M : : : 
Db 2938 AAACTT CCTT CCGATACAGAAAAAGAGGACAGATCACCAT CT GCTATATTTTCAGCAGAG 2997 

Qy 986 LeuGlyLysThrSerValValAspLeuLeuTyrTrpArgAspIleLysLysThrGlyVal 1005 

III I I I I I I II I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I II I I I I I I I II 



Db 2998 CT GAGTAAAACTT C AGTT GTT GACCT CCT GT ACT GGAGAGACATTAAGAAGACT GGAGT G 3057 

Qy 1006 ValPheGlyAlaSerLeuPheLeuLeuLeuSerLeuThrValPheSerlleValSerVal 1025 

I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I II I I I I 
Db 3058 GTGTTTGGTGCCAGCCTATTCCTGCTGCTTTCATTGACAGTATTCAGCATTGTGAGCGTA 3117 

Qy 1026 ThrAlaTyrlleAlaLeuAlaLeuLeuSerValThrlleSerPheArglleTyrLysGly 1045 

I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 3118 ACAGCCTACATTGCCTTGGCCCTGCTCTCTGTGACCATCAGCTTTAGGATATACAAGGGT 3177 

Qy 1046 VallleGlnAlalleGlnLysSerAspGluGlyHisProPheArgAlaTyrLeuGluSer 1065 

I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I II I I I I I II I I I II I I I II I I I I I I I 
Db 3178 GTGAT CCAAGCTAT CCAGAAATCAGATGAAGGCCACCCATTCAGGGCAT ATCTGGAATCT 3237 

Qy 1066 GluValAlalleSerGluGluLeuValGlnLysTyrSerAsnSerAlaLeuGlyHisVal 1085 

I I I I I I I I I I I I I I I I I I I II I I I I I II II I I I I I I I II I I I I I I I I I I I I I I I I I I I I I 
■ Db 3238 GAAGTT GCT AT AT CT GAGGAGT T GGT TCAGAAGT ACAGTAATT CT GCT CTT GGT CAT GT G 3297 

Qy 1086 AsnCysThrlleLysGluLeuArgArgLeuPheLeuValAspAspLeuValAspSerLeu 1105 

I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | 
Db 3298 AACT GCAC GATAAAGGAACT CAGGC GCCTCT T CT T AGTT GATGAT TT AGT T GAT T CT CT G 3357 

Qy 1106 LysPheAlaValLeuMetTrpValPheThrTyrValGlyAlaLeuPheAsnGlyLeuThr 1125 

I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 
Db 3358 AAGT TT GCAGT GT T GAT GT GGGT AT TT ACCT AT GT T GGT GCCT T GTTTAAT GGT CT GACA 3417 

Qy 1126 LeuLeuIleLeuAlaLeuIleSerLeuPheSerValProVallleTyrGluArgHisGln 1145 

I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 
Db 3418 CTACTGATTTTGGCTCTCATTTCACTCTTCAGTGTTCCTGTTATTTATGAACGGCATCAG 3477 

Qy 1146 AlaGlnlleTVspHisTyrLeuGlyLeuAlaAsnLysAsnValLysAspAlaMetAlaLys 1165 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I 
Db 3478 GC GCAGATAGAT C ATT AT CT AGGACTT GCAAATAAGAAT GTTAAAGAT G CT AT G GCTAAA 3537 

Qy 1166 IleGlnAlaLysIleProGlyLeuLysArgLysAlaGlu 1178 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 3538 AT CCAAG CAAAAAT C CCT GGATT GAAGCGCAAAG CT GAA 3576 
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Alignment Scores : 



Pred. No.: 


0 


Length: 


3579 


Score : 


5810.00 


Matches : 


1159 


Percent Similarity: 


97,49% 


Conservative : 


4 


Best Local Similarity: 


97.15% 


Mismatches : 


14 


Query Match: 


98.09% 


Indels: 


16 


DB: 


9 


Gaps : 


3' 



US-09-830-972-29 (1-1178) x US-09-893-348-22 (1-3579) 



Qy 

Db 



1 MetGluAspLeuAspGlnSerProLeuValSerSerSerAspSerProProArgProGln 20 

I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 
1 ATGGAAGACCTGGACCAGTCTCCTCTGGTCTCGTCCTCGGACAGCCCACCCCGGCCGCAG 60 



Qy 

Db 



21 ProAlaPheLysTyrGlnPheValArgGluProGluAspGluGluGluGluGluGluGlu 40 

I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I II I I I I 
61 C CC GCGTT CAAGTAC CAGTT CGT GAGGGAGC CC GAGGACGAGGAGGAAGAAGAGGAGGAG 120 



Qy 

Db 



41 GluGluGluAspGluAspGluAspLeuGluGluLeuGluValLeuGluArgLysProAla 60 
I I II II I I I I I I 1 I II I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I 
121 GAAGAGGAGGAC GAGGACGAAGACCT GGAGGAGCT GGAGGT GCT GGAGAGGAAGC C C GC C 18 0 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



61 AlaGlyLeuSerAlaAlaProValProThrAlaProAlaAlaGlyAlaProLeuMetAsp 80 

I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I II I I I I I I I 

181 GCCGGGCTGTCCGCGGCCCCAGTGCCCACCGCCCCTGCCGCCGGCGCGCCCCTGATGGAC 240 

81 PheGlyAsnAspPheValProProAlaProArgGlyProLeuProAlaAlaProProVal 100 

II I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I 

241 TTCGGAAATGACTTCGTGCCGCCGGCGCCCCGGGGACCCCTGCCGGCCGCTCCCCCCGTC 300 

101 AlaProGluArgGlnProSerTrpAspProSerProValSerSerThrValProAlaPro 120 

I I I I II I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
301 GCCCCGGAGCGGCAGCCGTCTTGGGACCCGAGCCCGGTGTCGTCGACCGTGCCCGCGCCA 360 

121 SerProLeuSerAlaAlaAlaValSerProSerLysLeuProGluAspAspGluProPro 140 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
361 TCCCCGCTGTCTGCTGCCGCAGTCTCGCCCTCCAAGCTCCCTGAGGACGACGAGCCTCCG 420 



Qy 

Db 



141 
421 



AlaArgProProProProProProAlaSerValSerProGlnAlaGluProValTrpThr 

I II I I M I I I I II I I I I I I I I II I I I I II I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I 
GCCCGGCCTCCCCCTCCTCCCCCGGCCAGCGTGAGCCCCCAGGCAGAGCCCGTGTGGACC 



160 
480 



161 ProProAlaProAlaProAlaAlaProProSerThrProAlaAlaProLysArgArgGly 180 

M I 11 I I I I I I I I I I I I I I I I I I I I I I I I I I II I II I I I I I I I I I I I ! I I I I I I I I I I I I 
481 CCGCCAGCCCCGGCTCCCGCCGCGCCCCCCTCCACCCCGGCCGCGCCCAAGCGCAGGGGC 540 

181 SerSerGlyAla ■ ■ ValVal 186 

MINIMI::: III::: 
541 TCCTCGGGCTCAGTGGATGAGACCCTTTTTGCTCTTCCTGCTGCATCTGAGCCTGTGATA 600 

187 ************LysIleMetAspLeuLysGluGlnProGlyAsnThrIleSerAlaGly 206 

:: : II I II I I II I II I II M II I II I I I I M II II II I II II I I I 
•601 CGCTCCTCTGCAGAAAATATGGACTTGAAGGAGCAGCCAGGTAACACTATTTCGGCTGGT 660 

207 GlnGluAspPheProSerValLeuLeuGluThrAlaAlaSer***ProSerLeuSerPro 226 

I I II I II I I I I I I I I M I I II I II I II I I II II I I I M II I I I I I I I I I I I I II I I I 
661 CAAGAGGATTTCCCATCTGTCCTGCTTGAAACTGCTGCTTCTCTTCCTTCTCTGTCTCCT. 720 

227 LeuSerAlaAlaSerPheLysGluHisGluTyrLeuGlyAsnLeuSerThrValLeuPro 246 

I II I I I II I II I I I I M I I II I II I II I I I II I I II I II I I I .M II M II I II M II I I I 
721 CT CTCAGCC GCT TCTTT CAAAGAAC AT GAAT ACCTT GGTAATTT GT CAAC AGT AT T AC CC 780 

247 ThrGluGlyThrLeuGlnGluAsnValSerGluAlaSerLysGluValSerGluLysAla 266 

I I I II I I I I I I II II I I I I II II II II II I II I I II I M I II M I II I I I I I I II I II I I 

781 ACT GAAGGAAC ACTT CAAGAAAAT GT C AGT GAAGCTT CTAAAGAGGT CT CAGAGAAGGCA 840 

2 67 LysThrLeuLeuIleAspArgAspLeuThrGluPheSerGluLeuGluTyrSerGluMet 2 86 

II II M I I I I I I II II I I I I I I I I I I I I I I I I I I II I II II I I I I II I I I I I I II II I I I 

841 AAAACT CT ACT CAT AGAT AGAGAT TTAAC AGAGTT TT CAGAATT AGAAT ACTCAGAAAT G 900 

287 GlySerSerPheSerValSerProLysAlaGluSerAlaVallleValAlaAsnProArg 306 

II I I II II II I I I II II I II I I I I II I I I II I I I II II I I I II I II I I I I I I II M II I I 

901 GGAT CAT CGTT C AGT GT CT CT C CAAAAGCAGAAT CT GC C GTAAT AGT AGCAAAT C CT AGG 960 

GluGluIlelleValLysAsnLysAspGluGluGluLysLeuValSerAsnAsnlleLeu 326 

I I II M I I I I I I M I I II I II I I I I II I I II II M I II I II I I I I I II I Ml I II I II I I 
GAAGAAATAAT C GT GAAAAATAAAGAT GAAGAAGAGAAGTT AGTT AGT AATAAC AT CCTT 1020 

327 His***GlnGlnGluLeuProThrAlaLeuThrLysLeuValLysGluAspGluValVal 346 

III I I I II I I I I I I I I I I II I I II M II I II II II II I I I I II II I I I I II M II I I 
1021 C ATAAT CAACAAGAGTT ACCT ACAGCT CT T ACTAAATT GGTT AAAGAGGAT GAAGTT GT G 1080 

347 SerSerGluLysAlaLysAspSerPheAsnGluLysArgValAlaValGluAlaProMet 366 

II II I I I I II I I I I II I II I I II I I I II I I I I I I II II II I I I I I I I I I I II II II II I I 
1081 T CTT C AGAAAAAGCAAAAGACAGT TTT AAT GAAAAGAGAGT T GC AGT GGAAGC T C CT AT G 1140 

367 ArgGluGluTyrAlaAspPheLysProPheGluArgValTrpGluValLysAspSerLys 386 

I II I I I II I I I I II I I I II I I II M II II I I I I II I II I I M M Ml M M I I I I I II I I 
1141 AGGGAGGAAT AT GCAGACTT CAAACC AT TT GAG CGAGT AT GGGAAGT GAAAGAT AGT AAG 1200 

387 GluAspSerAspMetLeuAlaAlaGlyGlyLysIleGluSerAsnLeuGluSerLysVal 406 

II II II II I II II II M II II II II I II I II II I II II II I I I I II I II I I I II II M II 
1201 GAAGAT AGT GAT AT GTT GGCT GCT GGAGGT AAAAT CGAGAGCAACTT GGAAAGT AAAGT G 1260 

407 AspLysLysCysPheAlaAspSerLeuGluGlnThrAsnHisGluLysAspSerGluSer 426 
I I II M I I I I II I I I I I I I II I II I I II I I II M I M I I I I II II II I II I I I II I II II 
1261 GAT AAAAAAT GT TTT GCAGAT AGC CT TGAGCAAACTAAT C AC GAAAAAGAT AGT GAGAGT 1320 



Qy 


427 


SerAsnAspAspThrSerPheProSerThrProGluGlylleLysAspArgSerGlyAla 


446 






1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1321 


AGTAATGAT GAT ACTT CT T T CC C CAGT ACGC C AGAAGGT ATAAAGGAT CGTC C AGGAGCA 


1380 


Qy 


447 


TyrlleThrCysAlaProPheAsnProAlaAlaThrGluSerlleAlaThrAsnllePhe 


466 






1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




.Db 


1381 


TATATCACAT GTGCTCCCTTTAACCCAGCAGCAACTGAGAGCATT GCAACAAACATTTTT 


1440 


Qy 


467 


ProLeuLeuGlnAspProThrSerGluAsn***ThrAspGluLysLysIleGluGluLys 


486 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 I 1 1 I 1 1 1 1 1 1 1 1 
■ ■ ■ ■ ■ ■ i i i i i i i i i i i i t i i i i t i i i i i i ) i i i i i i i 1 i i i i i } i i i i i i i i i 




Db 


1441 


C CT TTGTT AGGAGAT CCT ACTT CAGAAAATAAGAC C GAT GAAAAAAAAAT AGAAGAAAAG 


1500 


Qy 


487 


LysAlaGlnlleValThrGluLysAsnThrSerThrLysThrSerAsnProPhePheVal 


506 






1 1 1 1 I 1 1 i 1 i 1 1 1 1 i 1 1 1 1 1 1 If 1 It 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 III 
i i i i i i i i i i i i i i i i i i i i i i t i i t i i i i i i i i i i i i i i i i i i i i i i i i i i i i iti 




Db 


1501 


AAGGCC CAAAT AGTAACAGAGAAGAAT ACT AGC ACCAAAAC AT CAAAC CCTT TT CTT GT A 


1560 


Qy 


507. 


AlaAlaGlnAspSerGluThrAspTyrValThrThrAspAsnLeuThrLysValThrGlu 


526 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 
i i i i i i i i i i i i i i 1 i i i i i i i i i i t i i i i i i i i i i i i i i i i i i i i t i i i i i i i i i t i i i 




Db 


1561 


GCAGCAC AGGATT CT GAGACAGATT AT GT C ACAACAGAT AATT TAACAAAGGT GACT GAG 


1620 


Qy 


527 


GluValValAlaAsnMetProGluGlyLeuThrProAspLeuValGlnGluMaCysGlu 


546 






1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 M 1 1 1 1 I 1 II 1 1 1 1 ! 1 1 1 1 1 1 1 II 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 
i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i 




Db 


1621 


GAAGTCGT GGCAAACAT GC CT GAAGGC CT GACT CCAGATT T AGT ACAGGAAGCAT GT GAA 


1680 


Qy 


547 


SerGluLeuAsnGluValThrGlyThrLysIleAlaTyrGluThrLysMetAspLeuVal 


566 






1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 M 1 
i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i t i i i i i i i i i i t i i i i I i 




Db 


1681 


AGT GAATT GAAT GAAGT T ACT GGT ACAAAGAT T GCTTAT GAAACAAAAAT GGACTT GGTT 


1740 


Qy 


567 


GlnThrSerGluValMetGlnGluSerLeuTyrProAlaAlaGlnLeuCysProSerPhe 


586 






1 1 1 1 1 1 1 1 1 II ! 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 11 1 1 1 1 1 1 1 1 1 1 
i i i i i i i i i i i t i i i i i i i i i i i i i i i i i i i i i i i i i i i i t i i i i i i i t i i i i i i i i i i i 




Db 


1741 


CAAACAT CAGAAGTTATGCAAGAGTCACTCTAT CCTGCAGCACAGCTTTGCCCATCATTT 


1800 


Qy 


587 


GluGluSerGluAlaThrProSerProValLeuProAspIleValMetGluAlaProLeu 


606 






1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 11 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 11 

1 1 1 1 1 1 1 1 1 i 1 1 1 ! 1 1 I 1 1 1 ) 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1801 


GAAGAGT CAGAAGCT ACT CCTT C AC C AGTTTT GCCT GACATT GTT AT GGAAGCACCATT G 


1860 


Qy 


607 


AsnSerAlaValProSerAlaGlyAlaSerVallleGlnProSerSerSerProLeuGlu 


626 






1 1 1 1 1 1 M 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 11 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 
i i i i i i i i i i i i i i i i i i i f i i i i i i i i t i i i i 1 i i i i i i t i i i i i i i i i i i i i i t 1- 1 i i 




Db 


1861 


AATT CT GCAGTT CCT AGT GCT GGT GCT T CCGT GAT ACAGCCC AGCT CAT CAC CAT T AGAA 


1920 


Qy 


627 


AlaSerSerValAsnTyrGluSerlleLysHisGluProGlxiAsnProProProTyrGlu 


646 






1 1 1 II 1 II 1 1 1 1 1 1 1 II 1 1 1 1 1 Ml 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

1 1 1 1 ! 1 I 1 I 1 1 1 1 1 1 1 1 1 1 1 i 1 t 1 1 1 1 t 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | 1 1 | 1 | | | [ | 1 | | | I | | | 




Db 


1921 


GCTTCTTCAGTTAATTATGAAAGCATAAAACATGAGCCTGAAAACCCCCCACCATATGAA 


1980 


Qy 


647 


GluAlaMetSerValSerLeu LysValSerGlylleLysGluGluIleLysGluPro 


665 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
i i i i i i i i i i i i i i i i i i i i i i i i i i i * i i i i i i i i i i i i i i i i t i i t i i i i i i i i i 




Db 


i981 


GAGGCCAT GAGT GT ATC ACT AAAAAAAGT AT CAGGAATAAAGGAAGAAATTAAAGAGCCT 


2040 


Qy 


666 


GluAsnlleAsnAlaAlaLeuGlnGluThrGluAlaProTyrlleSerlleAlaCysAsp 


685 






1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


2041 


GAAAAT ATTAAT G C AGCT CTT CAAGAAAC AGAAGCT C CTT AT AT AT CT AT T G CAT GT GAT 


2100 


Qy 


686 


LeuIleLysGluThrLysLeuSerAlaGluProAlaProAspPheSerAspTyrSerGlu 


705 






1 II 1 II 1 1 1 II 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


2101 


TTAATTAAAGAAACAAAGCTTTCTGCTGAACCAGCTCCGGATTTCTCTGATTATTCAGAA 


.2160 


Qy 


706 


MetAlaLysValGluGlnProValProAspHisSerGluLeuValGluAspSerSerPro 


725 



I I I I I M I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I ! I i I I I I II I I I I I I I I I I I 
Db 2161 AT GGCAAAAGT T GAACAGC C AGT GC CT GAT CATT CT GAGCT AGTTGAAGATT CCT CACCT 2220 

Qy 726 AspSerGluProValAspLeuPheSerAspAspSerlleProAspValProGlnLysGln 745 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I 
Db 2221 GATTCT GAAC C AGTT GACT T ATTT AGT GAT GATT CAAT AC CTGAC GT T C CACAAAAACAA 2280 

Qy 746 AspGluThrValMetLeuValLysGluSerLeuThrGluThrSerPheGluSerMetlle 765 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I II I I I I I I I I I I I I I I I 
Db 2281 GAT GAAACT GT GAT GCTTGT GAAAGAAAGT CT CACT GAGACTT C ATT T G AGT CAAT GAT A 2340 

Qy 766 GluTyrGluAsnLysGluLysLeuSerAlaLeuProProGluGlyGlyLysProTyrLeu 785 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I II I I I I I I I I I I I I I I I I I I I I I I II I I I 
Db 2341 GAATATGAAAATAAGGAAAAACTCAGTGCTTTGCCACCTGAGGGAGGAAAGCCATATTTG 24 00 

Qy 786 GluSerPheLysLeuSerLeuAspAsnThrLysAspThrLeuLeuProAspGluValSer 805 

I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I M II II I I I I I I I I I I I I 1.1 I I I I 
Db 2401 GAAT CT TTTAAGCT CAGTT T AGATAAC ACAAAAGAT ACCCT GTT AC CT GAT GAAGTTT CA 2460 

Qy 806 ThrLeuSerLysLysGluLysIleProLeuGlnMetGluGluLeuSerThrAlaValTyr 825 

I I I I M I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 
Db 2461 AC ATTGAGCAAAAAGGAGAAAAT T C CTTT GC AGAT GGAGGAGCT CAGT ACT GCAGT TT AT 2520 

Qy 826 SerAsnAspAspLeuPhelleSerLysGluAlaGlnlleArgGluThrGluThrPheSer 845 

I I II I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2521 T CAAAT GAT GACT TAT T TAT TT CTAAGGAAGC ACAGATAAGAGAAACT GAAACGTT TT CA 2580 

Qy 846 AspSerSerProIleGluIlelleAspGluPheProThrLeuIleSerSerLysThrAsp 865 

I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I II I I I I I 
Db 2581 GATTCATCTCCAATTGAAATTATAGATGAGTTCCCTACATTGATCAGTTCTAAAACTGAT 2640 

Qy 866 SerPheSerLysLeuAlaArgGluTyrThrAspLeuGluValSerHisLysSerGluIle 885 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 2641 T CATTTT CTAAATT AGCC AG GGAAT AT ACT GAC CT AGAAGT AT CCCACAAAAGT GAAATT 2700 

Qy 886 AlaAsnAlaProAspGlyAlaGlySerLeuProCysThrGluLeuProHisAspLeuSer 905 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 

Db 2701 GCTAATGCCCCGGATGGAGCTGGGTCATTGCCTTGCACAGAATTGCCCCATGACCTTTCT 2760 

Qy 906 LeuLysAsnlleGlnProLysValGluGluLysIleSerPheSerAspAspPheSerLys 925 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I II I I I I I I I I I I I I I I I I I 
Db 2761 TTGAAGAACATACAACCCAAAGTTGAAGAGAAAATCAGTTTCT CAGATGACTTTT CTAAA 2820 

Qy 926 AsnGlySerAlaThrSerLysValLeuLeuLeuProProAspValSerAlaLeuGlyHis 945 

I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2821 AATGGGTCTGCTACATCAAAGGTGCTCTTATTGCCTCCAGATGTTTCTGCTTTGGCC 2877 

Qy 946 ThrGlnAlaGluIleGluSerlleValLysProLysValLeuGluLysGluAlaGluLys 965 

I II I I I I I I I I I I I I I I II I I I I I I I I I I I I I I M II I I I I I I I I I I I I I I I I I I I I 
Db 2878 ACTCAAGC AGAGAT AGAGAGC AT AGT TAAACC CAAAGT T CTTGT GAAAGAAGCT GAGAAA 2937 

Qy 966 LysLeuProSerAspThrGluLysGluAspArgSerProSerAlallePheSerAlaAsp 985 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I II I I I I I I I I I I I I I : : : 
Db 2938 AAACTT CCTT CCGATACAGAAAAAGAGGACAGAT CACCAT CTGCT ATATTTTCAGCAGAG 2997 

Qy 986 LeuGlyLysThrSerValValAspLeuLeuTyrTrpArgAspIleLysLysThrGlyVal 1005 

III I I I I I I I I II I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I 



36 


703 


38. 


0 


1520 


37 


699 


37. 


8 


1473 


38 


695 


37. 


6 


422 


39 


678 


36. 


6 


1766 


40 


678 


36. 


6 


1766 


41 


678 


36. 


6 


1766 


42 


677 


36. 


6 


1915 


43 


. 674 


36. 


4 


1668 


44 


674 


36. 


4 


2768 


45 


672 


36. 


3 


422 



15 US-10-084-817-333 
15 US-10-205-194-128 

9 US-09-960-352-8477 

10 US-09-809-391-254 
10 US-09-882-17 1-254 
17 US-10-164-861-254 

17 US-10-27 6-774- 980 
9 US-09-765-205-25 

18 US-10-723-8 60-68 67 
9 US-09-960-352-11567 



Sequence 333, App 
Sequence 128, App 

Sequence 8477, Ap 
Sequence 254 , App 
Sequence 254 , App 
Sequence 254, App 
Sequence 980, App 

Sequence 25, Appl 
Sequence 6867 , Ap 

Sequence 11567, A 



ALIGNMENTS 

RESULT 1 
US^O.9^2.8 9^q&6^5 

^Sequence 5, J^>plication US/09789386 

;F dfcent Nu. ~lTS20020010324Al 

; GENERAL INFORMATION:- 

; APPLICANT: ^CHALOVI CH , DAVI D ^ 

; APPLICANT: PRINJHA, "'kAHTNDER KUMAR 

; TITLE OF INVENTION: NOVEL COMPOUNDS 

; FILE REFERENCE: GP-30165-C1 

; CURRENT APPLICATION NUMBER: US/09/789,386 

; CURRENT FILING DATE: 2001-02-21 

PRIOR APPLICATION NUMBER: U.K. 9916898.1 
; PRIOR FILING DATE: 1999-07-19 
; PRIOR APPLICATION NUMBER: U.K. 9816024.5 
; PRIOR FILING DATE: 1998-07-22 
; PRIOR APPLICATION NUMBER: US 09/359,208 
; PRIOR FILING DATE: 1999-07-22 
; NUMBER OF SEQ ID NOS : 6 

; SOFTWARE: FastSEQ for Windows Version 3.0 
; SEQ ID NO 5 
; LENGTH: 1122 
TYPE: DNA 

ORGANISM: HOMO SAPIENS 
US-09-789-386-5 



Alignment Scores : 



Pred. No . : 


5.13e-152 


Length: 


1122 


Score: 


1829.00 


Matches: 


360 


Percent Similarity: 


96.51% 


Conservative : 


0 


Best Local Similarity: 


96.51% 


Mismatches : 


1 


Query Match: 


98.86% 


Indels : 


12 


DB: 


9 


Gaps : 


1 



SEQ29 (1-361) x US-09-789-386-5 (1-1122) 



Qy 1 MetGluAspLeuAspGlnSerProLeuValSerSerSerAspSerProProArgProGln 20 

) I I M I I I I I I I I I I I I I I II I I I I I I I | | | | | | | | | | 1 | | | | | | | M I I I I I I I I I I I I 
Db 1 ATGGAAGACCTGGACCAGTCTCCTCTGGTCTCGTCCTCGGACAGCCCACCCCGGCCGCAG 60 

Qy 21 ProAlaPheLysTyrGlnPheValArgGluProGluAspGluGluGluGluGluGluGlu 40 

I I I I I I I I M I I I I II I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 C CCGC GTT CAAGT ACCAGTT C GT GAGGGAGC CC GAGGACGAGGAGGAAGAAGAGGAGGAG 120 



Qy 


41 


Db 


121 


Qy 


61 


Db 


181 


Qy. 


81 


Db 


241 


Qy 


101 


Db 


301 


Qy 


121 


Db 


361 


Qy 


141 


Db 


421 


Qy 


161 


Db 


481 


Qy 


174 


Db 


541 


Qy 


189 


Db 


601 


Qy 


209 


Db 


661 


Qy 


229 


Db 


721 


Qy 


249 


Db 


781 


Qy 


269 


Db 


841 


Qy 


289 


Db 


901 


Qy 


309 



GluGluGluAspGluAspGluAspLeuGluGluLeuGluValLeuGluArgLys ProAla 60 
I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I | | | | | | | | | | | | | i | 
GAAGAGGAGGACGAGGAC GAAGACCT GGAGGAGCT GGAGGTGCT GGAGAGGAAGCCC GC C 180 

AlaGlyLeuSerAlaAlaProValProThrAlaProAlaAlaGlyAlaProLeuMetAsp 80 
I I I I I I I I I M M I I I I I I I I I I I I I I I I I I I I I I M | | | | | | | | | | I I I I I I I I I I I I I 
GCCGGGCTGTCCGCGGCCCCAGTGCCCACCGCCCCTGCCGCCGGCGCGCCCCTGATGGAC 240 

PheGlyAsnAspPheValProProAlaProArgGlyProLeuProAlaAlaProProVal 100 
I M I I I I I I M I I M I I I I I I I I j I I I | | | | | ! | | | ] | | | | [ | | E | j | | | m I I I M I I 1 
TTCGGAAATGACTTCGTGCCGCCGGCGCCCCGGGGACCCCTGCCGGCCGCTCCCCCCGTC 300 

AlaProGluArgGlnProSerTrpAspProSerProValSerSerThrValProAlaPro 120 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I 
GCCCCGGAGCGGCAGCCGTCTTGGGACCCGAGCCCGGTGTCGTCGACCGTGCCCGCGCCA 360 

SerProLeuSerAlaAlaAlaValSerProSerLysLeuProGluAspAspGluProPro 140 
I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
TCCCCGCTGTCTGCTGCCGCAGTCTCGCCCTCCAAGCTCCCTGAGGACGACGAGCCTCCG 420 

AlaArgProProProProProProAlaSerValSerProGlnAlaGluProValTrpThr 160 
I I I I M I I I I I M I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M 
GCCCGGCCTCCCCCTCCTCCCCCGGCCAGCGTGAGCCCCCAGGCAGAGCCCGTGTGGACC 4 80 

ProProAlaProAlaProAlaAlaProProSerThrSer 173 

I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 

CCGCCAGCCCCGGCTCCCGCCGCGCCCCCCTCCACCCCGGCCGCGCCCAAGCGCAGGGGC 54 0 

: -ValValAspLeuLeuTyrTrpArgAspIleLysLysThrGlyVal 188 

i I I I I I I I I M It I I I I I I I I I I I I I I I I I I I I [ | I I I I I I i I I I 
T CCT CGGGCT CAGT GGTT GTT GACCT C CT GTACTGGAGAGACATT AAGAAGACT GGAGT G 600 

ValPheGlyAlaSerLeuPheLeuLeuLeuSerLeuThrValPheSerlleValSerVal 208 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 
GT GTTT GGT GC CAGC CT ATT C CT GCT GCTT TCATT GACAGT ATT C AGCATT GT GAGC GT A 660 



I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I II II I I I I I I I I 
ACAGCCT AC AT TGC CTT GGCC CT GCT CT C T GT GAC CAT C AGCTTT AGGAT AT ACAAGGGT 720 



M I I I I II I I I I I M I I I I I I I I I I I I I I I I M I I | | | | | | | | | | | | | | | M I I I I I I I I 

GT GATC CAAGCT AT C CAGAAAT CAGAT GAAGGC CAC C CAT T C AGGGC AT AT CT GGAAT CT 780 



I I N I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | 

GAAGTTGCTATATCT GAGGAGTTGGTT CAGAAGTACAGTAATTCT GCT CTT GGT CATGT G 840 

AsnCysThrlleLysGluLeuArgArgLeuPheLeuValAspAspLeuValAspSerLeu 288 
M HI I M I I I I I I I I I I I I II I I | | | | | | | | | | | | | | | | | | | | | | | | M | | | | | || 1| | 
AACT GCAC GATAAAGGAACT CAGGC GCCT CTT CTT AGT T GAT GATT T AGTT GAT TC T CT G 900 



M I I I I I I I I I I I I I I II I I I I I I I I I I II I I I I I I I I I 

AAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCCTTGTTTAATGGTCTGACA 960 

309 LeuLeuIleLeuAlaLeuIleSerLeuPheSerValProVallleTyrGluArgHisGln 328 



Db 



961 



M M III I I II III MM! II Mill II II I II II Ml || M | || Mill II II I II II I 
CTACTGATTTTGGCTCTCATTTCACTCTTCAGTGTTCCTGTTATTTATGAACGGCATCAG 1020 



QY 329 AlaGlnlleAspHisTyrLeuGlyLeuAlaAsnLysAsnValLysAspAlaMetAlaLys 34 8 

I I I I I I M I II II I II I I I I I II I II I I II I II II I M II I I I II I I I I I I M I I I I II I 
Db 1021 GC AC AGAT AGAT CATT AT CTAGGACTT GCAAAT AAGAAT GTTAAAGAT GCT AT GGCTAAA 1080 

Qy 349 IleGlnAlaLysIleProGlyLeuLysArgLysAlaGlu 361 

I I I I M I II I I I II II I I I I I II I I I I II I I II II II I I 
Db 1081 AT CC AAGCAAAAAT CC CT GGATT GAAGCGCAAAGCT GAA 1119 



RESULT 2 
US-09-765-205-5 

Sequence 5, Application US/09765205 
Patent No. US20020034 800A1 
GENERAL INFORMATION: 
APPLICANT: Cao, Li 

TITLE OF INVENTION: BONE MARROW SECRETED PROTEINS AND POLYNUCLEOTIDES 
FILE REFERENCE: 1458.004/200130.449 
CURRENT APPLICATION NUMBER: US/09/765, 205 
CURRENT FILING DATE: 2001-01-17 
PRIOR APPLICATION NUMBER: US/ 09/212 , 44 6 
PRIOR FILING DATE: 1998-12-16 
NUMBER OF SEQ ID NOS : 46 

SOFTWARE: FastSEQ for Windows Version 3.0 
SEQ ID NO 5 
LENGTH: 1610 
TYPE: DNA 
ORGANISM: human 
US-09-765-205-5 



Alignment Scores : 
Pred. No. : 
Score: 

Percent Similarity: 
Best Local Similarity: 
Query Match: 
DB: 



8.25e-152 


Length: 


1610 


1829.00 


Matches : 


360 


96.51% 


Conservative: 


0 


96.51% 


Mismatches : 


1 


98.86% 


Indels : 


12 


9 


Gaps : 


1 



SEQ29 


(1-361) 


Qy 


1 


Db 


132 


Qy 


21 


Db 


192 


Qy 


41 


Db 


252 


Qy 


61 


Db 


312 



x US-09-765-205-5 (1-1610) 

MetGluAspLeuAspGlnSerProLeuValSerSerSerAspSerProProArgProGln 20 
I I I I I I I I M 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I II I I I I I I I I I | 
ATGGAAGACCTGGACCAGTCTCCTCTGGTCTCGTCCTCGGACAGCCCACCCCGGCCGCAG 191 



ProAlaPheLysTyrGlnPheValArgGluProGluAspGluGluGluGluGluGluGlu 
I I I I N I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I || || I I | 
C CC GCGT T CAAGT AC CAGTT C GT GAGGGAGCC C GAGGAC GAGGAGGAAGAAGAGGAGGAG 

GluGluGluAspGluAspGluAspLeuGluGluLeuGluValLeuGluArgLysProAla 
I M I I I M I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I II 
GAAGAGGAG GACGAGGAC GAAGACCT GGAGGAGCT GGAG GT GCT GGAGAGGAAGC CCGC C 



M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 

GCCGGGCTGTCCGCGGCCCCAGTGCCCACCGCCCCTGCCGCCGGCGCGCCCCTGATGGAC 



40 



251 



60 



311 



80 



371 



Db 



3565 AAGGAACTCAGGCGCCTCTTCTTAGTTGATGATTTAGTTGATTCTCTGAAGTTTGCAGTG 3624 



Qy 



Db 



121 LeuMetTrpValPheThrTyrValGlyAlaLeuPheAsnGlyLeuThrLeuLeuIleLeu 140 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I II II I I I I I I I II I I I I I I I I 11 1 I I I I 
3625 TTGATGTGGGTATTTACCTATGTTGGTGCCTTGTTTAATGGTCTGACACTACTGATTTTG 3684 



Qy 



141 AlaLeuIleSerLeuPheSerValProVallleTyrGluArgHisGlnAlaGlnlleAsp 160 
I I I I I I I I I I I I I I I I I I I I I II I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



Db 



3685 GCT CT C ATT TCACT CTTC AGT GT T CCT GTT AT TT AT GAAC GGCAT C AGGCACAGAT AGAT 3744 



Qy 



Db 



161 HisTyrLeuGlyLeuAlaAsnLysAsnValLysAspAlaMetAlaLysIleGlnAlaLys 180 

I I I I I I I I II I I I II I I I I I I I I I I I M II I I I I I I I I I I I I I I I I I I I I I I I I II I I I I 
3745 CATTATCTAGGACTT GCAAATAAGAAT GTTAAAGAT GCTATGGCTAAAATCCAAGCAAAA 3804 



Qy 



Db 



181 IleProGlyLeuLysArgLysAlaGlu 189 
I I I I I I I I I I I I I I I I I I I I I I I I I I I 
3805 AT C CCT GGATT GAAGCGCAAAGCT GAA 3831 



RESULT 2 



US- 08-2110-607 -2 



Sequence 2^)Application US/08700607 



Patent No. 5858708 
GENERAL INFORM* 




APPLICANT/: Bandman, oT^a 



APPLICANT T — ftii" Young, .Tamrce 



APPLICANT: Goli, Surya K. 
APPLICANT: Hillman, Jennifer L. 

TITLE OF INVENTION: TWO NOVEL HUMAN NSP-LIKE PROTEINS 
NUMBER OF SEQUENCES: 9 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Incyte Pharmaceuticals , Inc. 

STREET: 3174 Porter Drive 

CITY: Palo Alto 

STATE: CA 

COUNTRY: U.S. 

ZIP: 94304 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Diskette 

COMPUTER: IBM Compatible 

OPERATING SYSTEM: DOS 

SOFTWARE: FastSEQ Version 1.5 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 08/700 , 607 

FILING DATE: Filed Herewith 
ATTORNEY/ AGENT INFORMATION: 

NAME: Billings, Lucy J. 

REGISTRATION NUMBER; 36,749 

REFERENCE/ DOCKET NUMBER: PF-0114 US 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 415-855-0555 

TELEFAX: 415-845-4166 
INFORMATION FOR SEQ ID NO: 2: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 799 base pairs 

TYPE: nucleic acid 

STRANDEDNESS : single 

TOPOLOGY: linear 



MOLECULE TYPE: cDNA 
IMMEDIATE SOURCE: 
LIBRARY: 

CLONE: Consensus 
US-08-700-607-2 

Alignment Scores: 

Pred. No.: 4.77e-120 Length: 799 

Score: 927.00 Matches: 188 

Percent Similarity: 100.00% Conservative: 0 

Best Local Similarity: 100.00% Mismatches: 0 

Query Match: 99.57% Indels: 0 

DB: 2 Gaps: 0 

US-09-830-972-29_COPY_990_1178 (1-189) x US-08-700-607-2 (1-799) 

Qy 2 ValValAspLeuLeuTyrTrpArgAspIleLysLysThrGlyValValPheGlyAlaSer 21 

I I I I I I I 1 1 I I I I I I I I I I I I I II I I I I I I II I I I I I I I I I I I t I I I I I I t 1 I I I I I I ! I 
Db 108 GT T GTT GAC CT C CT GT ACT GGAGAGAC ATTAAGAAGACT GGAGT GGT GTTTGGT GC CAGC 167 

Qy 22 LeuPheLeuLeuLeuSerLeuThrValPheSerlleValSerValThrAlaTyrlleAla 41 

I I I I 1 I I I I I I I I I I I ! I I I I 11 I I II I I I I I I I I I I I I I I II I I I 1 I I I I I I I I I I I I I 
Db 168 CT AT TC CT GCT GCTT TCATT GACAGTATT C AGCAT T GT GAGCGTAAC AGC CT AC AT T GCC 227 

Qy 42 LeuAlaLeuLeuSerValThrlleSerPheArglleTyrLysGlyVallleGlnAlalle 61 

I I I II I M I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 228 TTGGCCCTGCTCTCTGTGACCATCAGCTTTAGGATATACAAGGGTGTGATCCAAGCTATC 287 

Qy 62 GlnLysSerAspGluGlyHisProPheArgAlaTyrLeuGluSerGluValAlalleSer 81 

I I I I I I I I I I I I I I I I I I I I I U I I I I I I I I I I II I I I I I I I I I I II I I I I I I I I I I I I I 
Db 288 C AGAAAT C AGAT GAAGGCCAC C C ATT C AGGGCAT AT CT GGAAT CT GAAGT T GCT AT AT CT 347 

Qy 82 GluGluLeuValGlnLysTyrSerAsnSerAlaLeuGlyHisValAsnCysThrlleLys 101 

I I I I I I I I I I I I I I I I I I II I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I II I 
Db 348 GAGGAGTT GGTT C AGAAGT AC AGTAATT CT GCT CTT GGT CAT GT GAACT GCAC GATAAAG 407 

Qy 102 GluLeuArgArgLeuPheLeuValAspAspLeuValAspSerLeuLysPheAlaValLeu 121 

I I I I II I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I 
Db 408 GAACTCAGGC GC CT CTT CT T AGTT GAT GATT T AGTT GATT CT CT GAAGTT T GCAGT GTT G 467 

Qy 122 MetTrpValPheThrTyrValGlyAlaLeuPheAsnGlyLeuThrLeuLeuIleLeuAla 141 

I I I I I I I I I I II I I I II I I I I I I I I II I II I I I I I I I I I I I I I I I I I I I II I I I I I I I II 
Db 468 ATGTGGGTATTTACCTATGTTGGTGCCTTGTTTAATGGTCTGACACTACTGATTTTGGCT 527 

Qy 142 LeuIleSerLeuPheSerValProVallleTyrGluArgHisGlnAlaGlnlleAspHis 161 

I I I I I I I I I II I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 528 CT C ATTT C ACT CT T C AGT GTT C CT GT T ATT TAT GAAC GGC AT CAGGCAC AGAT AGAT CAT 587 

Qy 162 TyrLeuGlyLeuAlaAsnLysAsnValLysAspAlaMetAlaLysIleGlnAlaLysIle 181 

I II I I I I I I I I I II I I I I I I II I I I I I I I I I I I I I I II I I II I I I I I I II I I I I I I I II I 
Db 58 8 TAT CT AGGACTT GCAAATAAGAAT GTTAAAGAT GCT AT GGCTAAAATC CAAGC AAAAAT C 647 

Qy 182 ProGlyLeuLysArgLysAlaGlu 189 

I II I I I I I I I I I I I I I I I I I I II I 
Db 648 C CT GGAT T GAAGC GCAAAGCT GAA 671 
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